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Hepatitis C Virus Cell Culture System 



5 ABSTRACT 

The hepatitis C virus (HCV) cell culture system according 
to the invention consists of human hepatoma cells, which 
are transfected with a HCV-RNA construct, that comprises 
10 the HCV specific RNA segments 5' NTR, NS3, NS4A, NS4B, 
NS5A, NS5B, and 3' NTR as well as a minimum of one marker 
gene for selection (selection gene) . 
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Hepat:i-b±s C Virus Cell Ciilt:ure Syst^em 

DESCRIPTION 

5 The invention relates to a hepatitis C virus (HCV) cell 
culture system, which comprises mainly eukaryotic cells 
containing transfected HCV specific genetic material^ which 
means they are transfected with HCV specific genetic 
material - 

10 

The hepatitis C virus (HCV) is one of the main causes 
worldwide of chronic and sporadic liver diseases. The 
history of most HCV infections does not involve any obvious 
clinical signs, but 80 - 90 % of the infected people become 

15 chronic carriers of the virus and 50 % of these chronic 
carriers of the virus develop chronic hepatitis with 
different degrees of severity. Approx. 20 % of the 
chronically infected develop a cirrhosis of the liver over 
10 to 20 years, based on what a primary hepatocellular 

20 carcinoma can develop. Nowadays chronic hepatitis C is the 
main indication for liver transplantation. A specific 
therapy does not exist until now. The only therapy 
currently available is high-dose administration of 
Interferon alpha or a combination of Interferon alpha and 

25 the purine nucleoside analogue Ribavirin. However, only 
approx. 60 % of all treated persons respond to this therapy 
and with these, a new viraemia occurs in more than half of 
all cases after the discontinuation of the treatment. 

30 Due to the high prevalence, especially in industrialized 
countries, the serious effects of chronic infections and 
the non-existence of a specific therapy, the development of 
a HCV specific chemotherapy is an important goal of 
pharmaceutical research and development. The main problem 

35 lies in the previous lack of a suitable cell culture 
system, which enables the study of virus replication and 
pathogenesis in eukaryotic cells. 
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Due to the small amount of virus in blood or tissue^, the 
lack of suitable cell culture systems or animal models (the 
chimpanzee is still the only possible experimental animal) 
5 as well as the lack of efficient systems for producing 
virus-like particles, it was not possible up to now, to 
analyze the molecular composition of the HCV particle in- 
depth and to solve it. The information currently available 
can be summarized as follows: HCV is an enveloped plus- 

10 strand RNA virus with a particle diameter of 50 - 60 nm and 
a medium density of 1.03 - 1.1 g/ml. It was molecularly 
cloned and characterized for the first time in 1989 (Choo 
et al., 1989, Science, 244, 359 - 362). The HCV-RNA has a 
length of approx. 9.6 kb (== 9600 nucleotides), a positive 

15 polarity and comprises one open reading frame (ORF) , which 
encodes a linear polyprotein of approx. 3010 amino acids 
(see Rice 1996, in Virology, B, N. Fields, D. M. Knipe, P. 
M. Howley, Eds. (Lippincott-Raven, Philadelphia, PA, 1996), 
vol. 1, pp. 931 - 960; Clarke 1997, J. Gen. Virol. 78, 

20 2397; and Bartenschlager 1997, Intervlrology 40, 378 and 
see Fig. 1 A). During the replication of the virus the 
polyprotein is cleaved into the mature and functionally 
active proteins by cellular and viral proteases. 

25 Within the polyprotein the proteins are arranged as follows 
(from the amino- to the carboxy terminus): Core-El-E2-p7- 
NS2-NS3-NS4A-NS4B-NS5A-NS5B. The core protein is the main 
component of the nucleocapsid. The glycoproteins El and E2 
are transmembrane proteins and the main components of the 

30 viral envelope. They probably play an important role during 
the a^ttachement of the virus to the host cell. These three 
proteins core. El, and E2 constitute the viral particle and 
are therefore called structural proteins. The function of 
the protein p7 is still not clear. The protein NS2 is 

35 probably the catalytic domain of the NS2-3 protease, which 
is responsible for the processing between the proteins NS2 
and NS3. The protein NS3 has two functions, one is a 
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protease activity in the amino terminal domain, which is 

essential for the polyprotein processing, and the other a 
NTPase/helicase function in the carboxy terminal domain, 
which is probably important during the replication of the 
5 viral RNA. The protein NS4A is a co-factor of the NS3 
protease. The function of the protein NS4B is unknown. 

The open reading frame is flanked on its 5' end by a non- 
translated region (NTR) approx . 340 nucleotides in length, 

10 which functions as the internal ribosome entry site (IRES) , 
and on its 3' end by a NTR approx- 230 nucleotides in 
length, which is most likely important for the genome 
replication. A 3' NTR such as this is the object of patent 
application PCT/US 96/14033. The structural proteins in the 

15 amino terminal quarter of the polyprotein are cleaved by 
host cell signal peptidase. The non-structural proteins 
(NS) 2 to (NS) 5B are processed by two viral enzymes, 
namely the NS2-3 and the NS3/4A protease. The NS3/4A 
protease is required for all cleavages beyond the carboxy 

20 terminus of NS3. The function of NS4B is unknown. NS5A, a 
highly phosphorylated protein, seems to be responsible for 
the Interferon resistance of various HCV genotypes (see 
Enomoto et al. 1995, J. Clin. Invest, 96, 224; Enomoto et 
al. 1996, Engl, J. Med, 334, 77; Gale Jr. et al. 1997, 

25 Virology 230, 217; Kaneko et al. 1994, Biochem. Biophys. 
Res, Commun. 205, 320; Reed et al., 1997, J. Virol. 71, 
7187), and NS5B has been identified as the RNA-dependent 
RNA polymerase. 

30 First diagnostic systems have been developed from these 
findings, which are either based on the detection of HCV 
specific antibodies in patient serum or the detection of 
HCV specific RNA using the reverse transcription polymerase 
chain reaction (RT-PCR) , and which are (must be) routinely 

35 used with all blood and blood products and/or according to 
the regulations. 
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Since the first description of the genome in 1989 several 
partial and complete sequences of the HCV have been cloned 
and characterized using the PGR method. A comparison of 
these sequences shows a high variability of the viral 
5 genome in particular in the area of the NS5B gene, which 
eventually resulted in the classification of 6 genotypes, 
which are again subdivided into the subtypes a, b, and c. 

The genomic variance is not evenly distributed over the 
10 genome. The 5'NTR and parts of the 3'NTR are highly 
conserved, while certain encoded sequences vary a lot, in 
particular the envelope proteins El and E2 . 

The cloned and characterized partial and complete sequences 
15 of the HCV genome have also been analyzed with regard to 
appropriate targets for a prospective antiviral therapy. In 
the course of this, three viral enzymes have been 
discovered, which may provide a possible target. These 
include (1) the NS3/4A protease complex, (2) the NS3 
20 Helicase and (3) the NS5B RNA-dependent RNA polymerase. The 
NS3/4A protease complex and the NS3 Helicase have already 
been crystallized and their three-dimensional structure 
determined (Kim et al., 1996, Cell, 87, 343; Yem et al., 
1998, Protein Science, 7, 837; Love et al., 1996, Cell, 87, 
25 311; Kim et al . , 1998, Structure, 6, 89; Yao et al . , 1997, 
Nature Structural Biology, 4, 463, Cho et al,, 1998, J. 
Biol. Chem. , 273, 15045). it has not been successful until 
now with the NS5B RNA-dependent RNA polymerase. 

30 Even though important targets for the development of a 
therapy for chronic HCV infection have been defined with 
these enzymes and even though a worldwide intensive search 
.for suitable inhibitors is ongoing with the aid of rational 
drug design as well as high throughput screening, the 

35 development of a therapy has one major deficiency, namely 
the lack of cell culture systems or simple animal models, 
which allow direct, reliable identification of HCV-RNA or 
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HCV antigens with simple methods which are common in the 
laboratory. The lack of these cell culture systems is also 
the main reason that to date the comprehension of HCV 
replication is still incomplete and mainly hypothetical. 

5 

Although according to the experts, a close evolutionary 
relationship exists between HCV and the flavi- and 
pestiviruses, and self-replicating RNAs have been described 
for these, which can be used for the replication in 
10 different cell lines with a relatively high yield, (see 
Khromykh et al., 1997, J. Virol. 71, 1497; Behrens et al., 
1998, J. Virol. 72, 2364; Moser et al . , 1998, J. Virol. 72, 
5318), similar experiments with HCV have not been 
successful to date. 

15 

Although it is known from different publications that cell 
lines or primary cell cultures can be infected with high 
titre patient serum containing HCV, (Lanford et al . 1994, 
Virology 202, 606; Shimizu et al. 1993, ProcBedings of the 

20 National Academy of Sciences, USA, 90, 6037 - 6041; 
Mizutani et al. 1996, Journal of Virology, 70, 7219 - 7223; 
M. Ikeda et al. 1998, Virus Res. 56, 157; Fournier et al. 
1998, J. Gen. Virol. 79, 2376 and bibliographical 
references quoted in here; Ito et al. 1996, Journal of 

25 General Virology, 11, 1043 -- 1054), these virus-infected 
cell lines or cell cultures do not allow the direct 
detection of HCV-RNA or HCV antigens. The viral RNA in 
these cells can not be detected in a Northern Blot (a 
standard method for the quantitative detection of RNA) or 

30 the viral protein in a Western Blot or with 
immunoprecipitation. It has only been possible to detect 
HCV replication with very costly and indirect methods. 
These disadvantageous facts show that obviously the 
replication in these known virus-infected cell lines or 

35 cell cultures is completely insufficient. 
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Furthermore it is known from the publications of Yoo et al. 
(1995, Journal of Virology, 69, 32 - 38) and of Dash et 
al., (1997, American Journal of Pathology, 151, 363 - 373) 
that hepatoma cell lines can be transfected with synthetic 
5 HCV-RNA, which are obtained through in vitro transcription 
of the cloned HCV genome. In both publications the authors 
started from the basic idea that the viral HCV genome is a 
plus-strand RNA functioning directly as mRNA after being 
transfected into the cell, permitting the synthesis of 

10 viral proteins in the course of the translation process, 
and so new HCV particles are (could be) formed. This viral 
replication, which means these newly formed HCV viruses and 
their RNA, have been detected through RT-PCR, However the 
published results of the RT-PCR carried out indicate, that 

15 the HCV replication in the described HCV transfected 
hepatoma cells is not particularly efficient and is not 
sufficient to measure the quality, let alone the quantity 
of the fluctuations in the replication rate after an 
targeted action with prospective antiviral treatments. 

20 Furthermore it is prior art (Yanagi et al., Proc. Natl. 
Acad. Sci. USA, 96, 2291-95, 1999), that the highly 
conserved 3' NTR is essential for the virus replication. 
This knowledge strictly contradicts the statements of Yoo 
et al. and Dash et al., who used for their experiments only 

25 HCV genomes with shorter 3' NTRs since they did not know 
the authentic 3' end of the HCV genome. 

The object of the present invention is to provide a HCV 
cell culture system, where the viral RNA self -replicates in 
30 the transfected cells with such a high efficiency that the 
quality and quantity of the fluctuations in the replication 
rate can be measured with common methodologies usually 
found in the laboratory after a targeted action with virus 
and prospective HCV specific antivirals in particular. 

35 

The solution to this problem is to provide a cell culture 
system of the prior mentioned type, where the eukaryotic 



CA 



02303526 



2000- 

7 - 



03-31 



cells are human cells, in particular hepatoma cells, which 
are preferably derived from a normal hepatoma cell line, 
but can also be obtained from an appropriate primary cell 
culture, and where the transfected HCV specific genetic 
5 material is a HCV-RNA construct, which essentially 
comprises the HCV specific RNA segments 5' NTR, NS3, NS4A, 
NS4B, NS5A, NS5B, and 3' NTR preferably in the order 
mentioned as well as a minimum of one marker gene for 
selection (selection gene) . 

10 

Here and in the following "NTR" stands for "non-translated 
region" and is a known and familiar term or abbreviation to 
the relevant expert, 

15 Here and in the following the term "HCV-RNA construct" 
comprises constructs, which include the complete HCV 
genome, as well as those, which only include a part of it, 
which means a HCV subgenome. 

20 A preferred variation of the cell culture system according 
to the invention, which had proven to be worthwhile in 
practice, is lodged at the DSMZ, Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH (German collection of 
Microorganisms and Cell Cultures) in Braunschweig, Germany 

25 under the number DSM ACC23 94 (laboratory name HuBl 9-13) . 

With the cell culture system according to the invention an 
in vitro system is provided for the first time, where HCV- 
RNA is self --replicated and expressed intracellularly and in 

30 a sufficient amount, so that the quantity of the amounts of 
HCV- RNA as well as the HCV specific proteins can be 
determined with conventional and reliably precise 
biochemical measuring methods. This means an almost 
authentic cell-based HCV replication system is available 

35 for the first time, which is urgently needed for the 
development and testing of antiviral drugs. This test 
system provides the possibility of identifying potential 
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targets for an effective HCV specific therapy and 
developing and evaluating HCV specific chemotherapeuticals . 

The invention is based on the surprising finding that 
5 efficient replication of the HCV-RNA only occurs in cells 
if they have been transfected with an HCV-RNA construct, 
which comprises at least the 5' and the 3' non-translated 
regions (NTR) and the non-structural proteins (NS) 3 to 5B 
and additionally a marker gene for selection (selection 

10 gene) . The structural genes are obviously without great 
importance for replication, whereas efficient replication 
of the HCV-RNA apparently only occurs if the transfected 
cells are subject to permanent selection pressure, which is 
imparted by the marker gene for selection (selection gene) 

15 linked to the HCV-RNA. Consequently the marker gene 
(selection gene) seems on one hand to provoke the selection 
of those cells, where the HCV-RNA replicates productively, 
and it seems on the other hand to considerably increase the 
efficiency of the RNA replication. 

20 

An object of the invention is also a cell-free HCV-RNA 
construct, characterized in that it comprises the HCV 
specific RNA segments 5' NTR, NS3, NS4A, NS4B, NS5A, NS5B, 
and 3' NTR, preferably in the order mentioned, as well as a 
25 marker gene for selection (selection gene) • 

In the present context 'the terms 5' NTR and NS3 and NS4A 
and NS4B and NS5A and NS5B and 3' NTR comprise each 
nucleotide sequence, which is described in the state of the 
30 art as the nucleotide sequence for each functional segment 
of the HCV genome. 

By providing a HCV-RNA construct such as this, a detailed 
analysis of the HCV replication, pathogenesis and evolution 
35 in cell culture is possible for the first time. The HCV 
specific viral RNA can specifically be created as a 
complete genome or subgenome in any amount, and it is 
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possible, to manipulate the "RNA construct and consequently 
to examine and identify the HCV functions on a genetic 
level . 



5 Because all HCV enzymes identified as a main target for a 
therapy at the moment, namely the NS3/4A protease, the NS3 
helicase and the NS5B polymerase, are included in the HCV- 
RNA construct according to the invention, it can be used 
for all relevant analyses - 

10 

An embodiment of the HCV-RNA construct, which has proven to 
be worthwhile in practical use, stands out by the fact that 
it comprises the nucleotide sequence according to the 
sequence protocol SEQ ID N0:1. 

15 

Further embodiments with similar good properties for 
practical use are characterized in that they comprise a 
nucleotide sequence either according to sequence protocol 
SEQ ID NO: 2 or according to sequence protocol SEQ ID NO: 3 

20 or according to sequence protocol SEQ ID NO: 4 or according 
to sequence protocol SEQ ID NO: 5 or according to sequence 
protocol SEQ ID NO: 6 or according to sequence protocol SEQ 
ID NO: 7 or according to sequence protocol SEQ ID NO: 8 or 
according to sequence protocol SEQ ID NO: 9 or according to 

25 sequence protocol SEQ ID NO: 10 or according to sequence 
protocol SEQ ID NO: 11. 

It is possible to provide the HCV subgenomic construct with 
a 3' NTR, which has a nucleotide sequence so far unknown in 
30 the state of art, a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to (i) 
listed in the following: 



( a ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTAGCTTT 
35 TTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCAGACTGGAG 
T 
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( b ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTC TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GGAGC 

(c) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTAATCTTT 
TTTTTTTTCT TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GCAGC 

(d) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTAGTC 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

( e ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

( f ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT TCTTTTTTTT TTGAGAGAGA GAGTCTCACT CTGTTGCCCA 
GACTGGAGT 

(g) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTAGTCT 
TTTTTTTTTT CTTTTTTTTT GAGAGAGAGA GTCTCACTCT GTTGCCCAGA 
CTGGAGT 

(h) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTTAAT 

(^rprprjnrpr|.rprprprp ^ rp Q Q rp rj, rj. TTTTGAGAGA GAGAGTCTCA CTCTGTTGCC 

CAGACTGGAG T 

{ i ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTAATC 
^rprprprprprjnrprprp TTTTCTTTTT TTTTTGAGAG AGAGAGTCTC ACTCTGTTGC 
CCAGACTGGA GT 

The marker gene for selection (selection gene) included in 
the HCV-RNA constructs according to the invention is 
preferably a resistance gene, in particular an antibiotic 
resistance gene. 
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This has the advantage that the cells transfected with this 
construct can easily be selected from the non-transf ected 
cells by adding for example the appropriate antibiotic to 
the cell culture medium in the case of an antibiotic 
5 resistance gene. 

In the present context 'antibiotic' means any substance, 
which impedes the non-transf ected host cells or the cells, 
where the HCV-RNA is not replicating efficiently, 
10 continuing to live or grow, especially the cell poison 
Puromycin, Hygromycin, Zeocin, Bleomycin or Blasticidin. 

A preferred marker gene for selection (selection gene) and 
resistance gene, which has proven to be worthwhile in 
15 practice, is the neomycin phosphotransferase gene. 

An alternative for the antibiotic resistance genes is for 
example the thymidine kinase gene, which can be used to 
carry out a HAT selection. 

20 

The marker gene for selection (selection gene) , the 
preferred resistance gene and the most preferred antibiotic 
resistance gene is preferably positioned in the HCV-RNA 
construct after the HCV 5' NTR, which means downstream from 
25 the 5' NTR and upstream from the HCV reading frame. 

However, an insertion in the area of the 3' NTR or another 
site of the HCV genome or subgenome, for example within the 
polyprotein, is also conceivable. 

30 In another embodiment of the HCV-RNA construct according to 
the invention the marker gene for selection (selection 
gene) , in particular an antibiotic resistance gene, is 
linked to the HCV-RNA or HCV genomic or subgenomic sequence 
via a ribozyme or a recognition site for a ribozyme. 

35 

This has the advantage, that after the selection of the 
cells, in which the HCV-RNA is replicating productively. 
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the resistance gene in the obtained cell clones can be 
separated from the HCV subgenomic sequence through a 
ribozyme-dependent cleavage, namely by activating the 
inserted ribozyme or in the case of a construct with a 
5 recognition site for a ribozyme, by transfecting the 
ribozyme into the cells (for example through the 
transfection of a ribozyme construct or infection with a 
viral expression vector, into which the appropriate 
ribozyme has been inserted) . By this means an authentic HCV 
10 genomic construct can be obtained without a resistance 
gene, which can then form authentic infectious virus 
particles . 

Another preferred embodiment of the HCV-RNA construct 
15 according to the invention is characterized in that the 
construct has at least one integrated reporter gene. 

In the following a reporter gene means any gene, whose 
presence can be easily detected with, in general, simple 
20 biochemical or also histochemical methods after being 
transferred into a target organism, which means a gene, 
that encodes for a protein, which can be easily and 
reliably detected and quantified in small amounts with the 
common measuring methods in the laboratory. 

25 

This variation of the HCV-RNA construct has the advantage 
that the extent of the replication of this construct can be 
easily and quickly measured with the methods common in the 
laboratory using the reporter gene product. 

30 

The reporter gene is preferably a gene from the group of 
the luciferase genes, the CAT gene (chloramphenicol acetyl 
transferase gene) , the lacZ gene (beta galactosidase gene) , 
the GFP gene (green fluorescence protein gene) , the GUS 
35 gene (glucuronidase gene) or the SEAP gene (secreted 
alkaline phosphatase gene) . This reporter gene and its 
products, namely the relevant reporter proteins, can be 
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detected for example using fluorescence ^ chemiluminescence, 
colorimetric measurements or by means of immunological 
methods (for example ELISA) . 



A surrogate marker gene can also be considered as a 
reporter gene. In this context it includes those genes, 
which encode for cellular proteins, nucleic acids or 
generally for those functions, which are subject to 
variation depending on the replication of the virus, and 
which consequently are either suppressed or activated in 
the cells, in which the HCV or the HCV-RNA construct 
multiplies. This means, the suppression or activation of 
this function is a surrogate marker for the replication of 
the virus or the replication of the HCV-RNA construct. 



The positions of the reporter genes and the marker gene for 
selection (selection gene) can be selected in such a way, 
that a fusion protein made from both genetic products will 
be expressed. This has the advantage that these two genes 
can be arranged in such a way in the HCV-RNA construct that 
their two expressed proteins are fused via a recognition 
sequence for a protease (for example ubiquitin) or via a 
self-cleaving peptide (for example the 2A protein of the 
Picornaviruses) at first and will be separated 
proteolytically later . 



These two positions might as well lie apart from each other 
in such way, that both genetic products are separately 
expressed (for example in the order: marker or resistance 
gene — internal ribosome binding site — reporter gene) . 



In the case of the reporter gene one embodiment has proven 
to be particularly worthwhile, where the reporter gene is 
cloned into the open reading frame of the HCV genome or 
subgenome in such a way that it will only be transferred to 
an active form after proteolytic processing. 
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The cell culture system according to the invention can be 
used for various purposes in each of its embodiments. These 
comprise : 



5 • The detection of antiviral substances. This can include 
for example: organic compounds, which interfere directly 
or indirectly with viral growth (for example inhibitors 
of the viral proteases, the NS3 helicase, the NS5B RNA- 
dependent RNA polymerase) , antisense oligonucleotides, 

10 which will hybridize to any target sequence in the HCV- 

RNA construct (for example the 5' NTR) and will have an 
direct or indirect influence on the virus growth for 
example due to a reduction of translation of the HCV 
polyprotein or ribozymes, which cleave any HCV-RNA 

15 sequence and consequently impair virus replication. 

• The evaluation of any type of antiviral substances in 
the cell culture. These substances can be detected on 
the isolated purified enzyme for example with 'rational 
20 drug design' or 'high-throughput screening'. Evaluation 

means mainly the determination of the inhibitory 
features of the respective substance as well as its mode 
of action. 



25 • The identification of new targets of viral or cellular 
origin for a HCV specific antiviral therapy. If for 
example a cellular protein is essential for viral 
replication, the viral replication can also be 
influenced by inhibiting this cellular protein. The 

30 system according to the invention also enables the 

detection of these auxiliary factors. 



• The determination of drug resistance. It can be assumed 
that resistance to therapy occurs due to the high 
35 mutation rate of the HCV genome. This resistance, which 

is very important for the clinical approval of a 
substance, can be detected with the cell culture system 
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according to the invention. Cell lines, in which the 
HCV-RNA construct or the HCV genome or subgenome 
replicates, are incubated with increasing concentrations 
of the relevant substance and the replication of the 
5 viral RNA is either determined by means of an introduced 

reporter gene or through the qualitative or quantitative 
detection of the viral nucleic acids or proteins. 
Resistance is given if no or a reduced inhibition of the 
replication can be observed with the normal 

10 concentration of the active substance. The nucleotide 

and amino acid replacements responsible for the therapy 
resistance can be determined by recloning the HCV-RNA 
(for example by the means of RT-PCR) and sequence 
analysis. By cloning the relevant replacement /s into the 

15 original construct its causality for the resistance to 

therapy can be proven. 

• The production of authentic virus proteins (antigens) 
for the development and/or evaluation of diagnostics. 
20 The cell culture system according to the invention also 

allows the expression of HCV antigens in cell cultures. 
In principle these antigens can be used as the basis for 
diagnostic detection methods. 

The production of HCV viruses and virus-like particles, 
in particular for the development or production of 
therapeutics and vaccines as well as for diagnostic 
purposes. Especially cell culture adapted complete HCV 
genomes, which could be produced by using the cell 
cultur system according to the invention, are able to 
replicate in cell culture with high efficiency. These 
genomes have the complete functions of HCV and in 
consequence they are able to produce infectious viruses. 

35 The HCV-RNA construct according to the invention by itself 
can also be used for various purposes in all its 
embodiments. This includes first of all: 



25 • 



30 
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The construction of attenuated hepatitis C viruses or 
HCV-like particles and their production in cell 
cultures : 

Attenuated HCV or HCV-like particles can be created by 
accidental or purposefully introduced mutations, such as 
point mutations, deletions or insertions, which means 
viruses or virus-like particles with complete ability to 
replicate, but reduced or missing pathogenicity. These 
attenuated HCV or HCV-like particles can be used in 
particular as vaccine. 

The construction of HCV-RNA constructs with integrated 
foreign genes, used for example as liver cell specific 
vector in gene therapy. Due to the distinctive liver 
cell tropism of the HCV and the possibility of replacing 
parts of the genome by heterologous sequences, HCV-RNA 
constructs can be produced, where for example the 
structural proteins can be replaced by a therapeutically 
effective gene. The HCV-RNA construct obtained in this 
way is introduced into cells preferably by means of 
transf ection, which express the missing HCV functions, 
for example the structural proteins, in a constitutive 
or inducible way. Virus particles, carrying the HCV-RNA 
construct, can be created by means of this method known 
to the expert under the term ' transcomplementat ion ' . The 
particles obtained can preferably be used for the 
infection of liver cells. Within these the 
therapeutically effective foreign gene will be expressed 
and will consequently develop its therapeutic effect. 

The detection of permissive cells, which means cells, in 
which a productive virus growth occurs. For this purpose 
either one of the HCV-RNA genomic constructs previously 
mentioned, which is able to form complete infectious 
viruses, or one of the HCV subgenomes previously 
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mentioned, which according to the previously mentioned 
example will be transfected in a cell line first, which 
expresses the missing functions in a constitutive or 
inducible way, is used. In each case virus particles are 
5 created, which carry a resistance and/or reporter gene 

apart from the HCV sequence. In order to detect cells, 
where the HCV is able to replicate, these cells are 
infected with viruses generated in this way and subject 
to an antibiotic selection or they are examined 

10 depending on the HCV-RNA construct by means of 

determining the presence of the expression of the 
reporter gene. Because an antibiotic resistance or 
reporter gene expression can only be established, when 
HCV-RNA construct replicates, the cells detected in this 

15 way must be permissive. Almost any cell line or primary 

cell culture can be tested in regard to the permissivity 
and detected in this way. 



20 The cell culture system according to the invention also 
permits targeted discovery of HCV-RNA constructs for which 
there is an increase in the efficiency of replication due 
to mutations- This occurs either by chance, in the context 
of HCV-RNA replication, or by targeted introduction into 

25 the construct. These mutations, leading to a change in the 
replication of the HCV-RNA construct, are known to experts 
as adaptive mutations.' The invention therefore also 
includes a method for obtaining cell culture adapted 
mutants of a HCV-RNA construct according to the invention 

30 following the above description, in which the mutants have 
increased replication efficiency compared to the original 
HCV-RNA construct. It further includes a method for the 
production of mutants of a HCV-RNA full-length genome or of 
a HCV-RNA subgenome or of any HCV-RNA construct with 

35 increased replication efficiency compared to the original 
HCV-RNA full-length genome or subgenome or HCV-RNA 
construct, as well as cell culture adapted mutants of HCV- 
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RNA constructs, HCV-RNA " full-length genomes and HCV 
subgenomes with increased replication efficiency compared 
to the original constructs, subgenomes or full-length 
genomes . 

5 

The method according to the invention for the production of 
cell culture adapted mutants of a HCV-RNA construct 
according to the invention, in which the mutants have 
increased replication efficiency compared to the HCV-RNA 

10 construct, is characterised in that a cell culture system 
according to claim 1, in which the transfected HCV specific 
genetic material is a HCV-RNA construct with a selection 
gene according to claims 4 to 19, is cultivated on/in the 
selection medium corresponding to the selection gene, that 

15 the cultivated cell clones are collected and that the HCV- 
RNA construct is isolated from these cell clones. 

In an advantageous extension of this production method, the 
isolated HCV-RNA constructs are passaged at least one more 

20 time, that is they are transfected in cells of a cell 
culture system according to claim 1, the thus obtained cell 
culture system according to claim 1, in which the 
transfected HCV specific genetic material is the isolated 
HCV-RNA construct with a selection gene, is cultivated 

25 on/in the selection medium corresponding to the selection 
gene, the cultivated cell clones are collected and the HCV- 
RNA constructs are thus isolated. 

Using this process variation, the quantity of adaptive 
30 mutations and hence the degree of replication efficiency in 
the relevant HCV-RNA constructs can be increased even 
further . 

The method according to the invention for the production of 
35 mutants of a HCV-RNA full-length genome or of a HCV-RNA 
subgenome or of any HCV-RNA construct with increased 
replication efficiency compared to the original HCV-RNA 
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full-length genome or subgenome or HCV-RNA construct, has 
the following features. Using one of the two production 
methods presented above, a cell culture adapted mutant of a 
HCV-RNA construct is produced, isolated from the cells, 
5 cloned using state of the art known methods and sequenced. 
By comparing with the nucleotide and amino acid sequence of 
the original HCV-RNA construct, the type, number and 
position of the mutations is determined. These mutations 
are then introduced into an (isolated) HCV subgenome or 
10 full-length genome or any HCV-RNA construct, either by 
site-directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A test can be carried out to determine or verify which 
15 mutations actually are responsible for an alteration of 
replication efficiency, particularly an increase in 
replication. In this test the corresponding nucleotide 
and/or amino acid changes are introduced into the original 
HCV-RNA construct and the modified construct is then 
20 transfected in cell culture. If the introduced mutation 
actually leads to an increase in replication, then for a 
HCV-RNA construct with a selectable marker gene, the number 
of resistant cell clones in the artificially mutated 
construct should be noticeably higher compared to the 
25 untreated construct. 

In the case of a construct with a reporter gene, the 
activity or quantity of the reporter should be noticeably 
higher for the artificially mutated construct compared to 
30 the untreated one. 

The cell culture adapted HCV-RNA constructs with high 
replication efficiency according to the invention are 
characterized in that, through nucleotide or amino acid 
35 exchanges, they are derivable from a HCV-RNA construct 
according to one of the claims 4 to 19, and that they are 
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obtainable using one of the two production processes 
presented above. 

These cell culture adapted HCV-RNA constructs can be used 
to produce any HCV-RNA constructs or HCV full-length or 
subgenomes with increased replication efficiency. Both 
constructs with a selectable resistance gene and constructs 
without one or with a non-selectable reporter gene (e.g. 
luciferase) can be produced in this way, since replication 
of cell culture adapted HCV-RNA constructs can also be 
demonstrated in non-selected cells due to their high 
replication efficiency. 

The cell culture adapted mutants of a HCV-RNA construct or 
HCV-RNA full-length genome or HCV subgenome with high 
replication efficiency compared to the original HCV-RNA 
construct or the original HCV full-length genome are 
characterized in that they are obtainable by a method in 
which the type and number of mutations in a cell culture 
adapted HCV-RNA construct are determined through sequence 
analysis and sequence comparison and these mutations are 
introduced into a HCV-RNA construct, particularly a HCV-RNA 
construct according to one of the claims 4 to 19, or into 
an (isolated) HCV-RNA full-length genome, either by site- 
directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A group of preferred HCV-RNA constructs, HCV full-length 
genomes and HCV subgenomes with high and very high 
replication efficiency, which are consequently highly 
suitable for practical use is characterised in that it 
contains one, several or all of the amino acid or nucleic 
acid exchanges listed in table 3 and/or one or several of 
the following amino acid exchanges: 1283 arg -> gly , 1383 
glu -> ala , 1577 lys -> arg , 1609 lys -> glu , 1936 
pro -> ser , 2163 glu -> gly , 2330 lys -> glu , 2442 ile 
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-> val. (The numbers refer to the amino acid positions of 
the polyprotein of the HCV isolate conl, see Table 1). 

5 Special features of the nucleotid sequences according to the sequence listings: 

SEQ ID -NO: 1 

« 

Name : I38 9/Core-3 ' /wt 

Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
10 2 •342-1193: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
15 4-1813-10842: HCV Polyprotein from Core up to nonstructural 
protein 5B 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

20 7. 2962-4050: envelope protein 2 {E2) ; structural 

protein 

8. 4051-4239: Protein p7 

9. 4240-4890: nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

25 10.4891-6783: nonstructural protein 3 (NS3) ; HCV NS3 

Protease/Helicase 
11.6784-6945: nonstructural protein 4A (NS4A) ; NS3 

Protease cofactor 
12.6946-7728: nonstructural protein 4B (NS4B) 
30 13.7729-9069: nonstructural protein 5A (NS5A) 

14.9070-10842: nonstructural protein 58 (NS5B) ; RNA- 

dependent RNA-polymerase 
15.10846-11076: HCV 3' non-translated region 
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SEQ ID -NO: 2 

Name: I3 37/NS2-3 ' /wt 

Composition (Nucleotide positions) : 
1.1-341: HCV 5' non- t rans lat ed region 
5 2.342-1181: HCV Core Prot ein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3-1190-1800: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
10 4.1801-8403: HCV Polyprotein from nonstructural protein 2 
up to nonstructural protein 5B 

5. 1801-2451: nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

6. 2452-4344: nonstructural protein 3 (NS3) ; HCV NS3 
15 Protease /He licase 

7. 4345-4506: nonstructural protein 4A (NS4A) ; NS3 
Protease cofactor 

8. 4507-5289: nonstructural protein 4B (NS4B) 

9. 5290-6630: nonstructural protein 5A (NS5A) 

20 10,6631-8403: nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-polymerase 
11.8407-8637: HCV 3* non-translated region 



SEQ ID-NO: 3 

25 Name: 138 9/NS3-3 ' /wt 

Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
30 3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-7767: HCV Polyprotein from nonstructural protein 3 

up to nonstructural protein 5B 



CA 02303526 2000-03-31 



- 23 - 

5. 1813-3708: nonstructural protein 3 (NS3); HCV NS3 
Protease /He 1 lease 

6. 3709-3870: Nonstructural protein 4A {NS4A) ; NS3 
Protease Cofactor 

5 7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 
9- 5995-7767: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA- Polymerase 
10. 7771-8001: HCV 3' non-translated Region 

10 

SEQ ID -NO: 4 

Name: 1337 /NS 3 -3 ' /wt 

Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
15 2-342-1181: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1190-1800: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
20 4.1801-7758: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B 

5. 1801-3696: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease /Hel lease 

6. 3697-3858: Nonstructural protein 4A {NS4A) ; NS3 
25 Protease Cofactor - 

7. 3859-4641: Nonstructural protein 43 (NS4B) 

8. 4642-5982: Nonstructural protein 5A (NS5A) 

9. 5983-7755: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA- Polymerase 

30 10. 7759-7989: HCV 3' non-translated Region 



SEQ ID-NO: 5 

Name: I38 9/NS2-3 ' /wt 

Composition (Nucleotide positions) : 
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1.1-341: HCV 5* non-translated region 

2,342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
5 encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-8418: HCV Polyprotein from Nonstructural protein 2 

up to Nonstructural protein 5B 

5. 1813-2463: Nonstructural protein 2 {NS2); HCV NS2-3 
10 Protease 

6. 2464--4356: Nonstructural protein 3 (NS3); HCV NS3 
Protease/Helicase 

7. 4357-4518: Nonstructural protein 4A {NS4A) ; NS3 
Protease Cofactor 

15 8. 4519-5301: Nonstructural protein 4B (NS4B) 

9. 5302-6642: Nonstructural protein 5A (NS5A) 
10.6643-8415: Nonstructural protein 53 (NS5B) ; RNA- 

dependent RNA- Polymerase 
11. 8419-8649: HCV 3' non-translated Region 

20 

SEQ ID-NO: 6 

Name: I38 9/NS3-3 ' /9-13F 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
25 2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
30 4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 9-13F 
5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 
35 6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
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Protease Cofactor 



7 . 



3871-4653: Nonstructural protein 4B (NS4B) 

4654-5994: Nonstructural protein 5A (NS5A) 

5995-7767: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polyir^erase 



8 



9. 



5 



7771-8001: HCV 3* non- translated Region 
SEQ ID-NO: 7 

Name: I38 9/Core-3 V9-13F 
10 Composition (Nucleotide positions) : 

1-1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Prot ein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal rifcosome entry site from 
15 encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
protein 5B of the cell culture-adapted mutant 9-13F 
5- 1813-2385: HCV Core Protein; structural protein 
20 6. 2386-2961: envelope protein 1 (El); structural 



protein 

7, 2962-4050: envelope protein 2 (E2); structural 



25 



protein 

8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 



Protease 



10.4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 

Protease /Helicase 
11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 



30 



Protease Cofactor 



12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 



14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 



dependent RNA- Polymerase 



35 



15.10846-11076: HCV 3' non-translated Region 
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SEQ ID -NO: 8 

Name : I38 9/NS3-3 V5 . 1 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
5 2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
10 4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 5.1 
5. 1813-3708: Nonstructural protein 3 (NS3); HCV NS3 
Protease/Helicase 
15 6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

9- 5995-7767: Nonstructural protein 53 (NS5B) ; RNA- 
20 dependent RNA-Polymerase 

7771-8001: HCV 3' non-translated Region 

SEQ ID -NO: 9 

Name : I38 9/Core-3 ' /5 . 1 
25 Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
30 encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
protein 5B of the cell culture-adapted mutant 5.1 
5. 1813-2385: HCV Core Protein; structural protein 
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6. 2386-2961: envelope protein 1 (El); structural 

protein 

7. 2962-4050: envelope protein 2 (E2); structural 
protein 

5 8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2); HCV NS2-3 

Protease 

10.4891-6783: Nonstructural protein 3 (NS3); HCV NS3 
Pro tease/ He licase 
10 11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 
12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 
15 dependent RNA-Polymerase 

15.10846-11076: HCV 3' non-translated Region 

SEQ ID -NO: 10 

Name: I38 9/NS3-3' /I 9 
20 Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 
25 encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 58 of the cell culture- 
adapted mutant 19 
30 5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 

Protease /Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 
35 8. 4654-5994: Nonstructural protein 5A (NS5A) 
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9. 5995-7767: Nonstructural protein SB (NSSB) ; RNA- 
dependent RNA- Polymerase 
7771-8001: HCV 3' non-translated Region 

5 SEQ ID -NO: 11 

Name: 138 9 /Core- 3 VI 9 

Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
10 fusion protein; selectable Marker 

3,1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
15 protein 58 of the cell culture-adpated mutant 19 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2); structural 
20 protein 

8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

10,4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 
25 Protease/Helicase - 

11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 
12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
30 14.9070-10842: Nonstructural protein 5B (NSSB); RNA- 

dependent RNA- Polymerase 
15.10846-11076: HCV 3' non-translated Region 
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The invention is described in detail in the following by 
way of examples of embodiments and respective diagrams. 
These are as follows 

5 Fig. 1 A: The structure of a HCV-RNA construct according to 
the invention 

On the top a diagram of the structure of the 
complete parental HCV genome is given with the 

10 positions of the genes for the cleavage products 

core. El, E2, p7, NS2, NS3, NS4A, NS4B, NS5A, and 
NS5B within the polyprotein, and the 5' and 3' 
non-translated regions (5* NTR and 3' NTR) 
shown as a thick horizontal line and with the 

15 two positions selected for the creation of the 

subgenomic constructs, namely the position of the 
'GDD catalytic domain' of the NS5B RNA polymerase 
(GDD) and the position of the 3' end of the HCV- 
IRES (nucleotide positions 1 to 377 and 1 to 389) 

20 - drawn above the diagram of the genome. The 

numbers below the diagram of the genome indicate 
the respective nucleotide positions. 

Diagrams of the structure of two modified HCV-RNA 
25 constructs (subgenome) according to the invention 

are shown below, consisting of the 5' HCV-IRES, 
the neomycin phosphotransferase gene (Neo^) , the 
EMCV-IRES (E-I) and the HCV sequences of NS2 or 
NS3 up to the authentic 3* end. The position of 
30 the 10-amino acid deletion comprising the NS5B 

polymerase GDD motive is marked with a triangle { 
A) . 

Fig. 1 B: The result of a denaturing f ormaldehyde-agarose 
35 gel electrophoresis for the detection of 

replicating plus-strand RNA in transfected 
subpassaged Huh 7 cell clones. 
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The positions of HCV specific RNAs (arrows) and 
the 28S rRNA are specified to the right of lane 
12, the size (number of nucleotides) of the RNA 
5 marker (M) is specified to the left of lane 1. 

Fig. 1 C: The result of a PGR test with subsequent Southern 
Blot to demonstrate the absence of integrated 
replicon DNA in most of the selected cell clones. 

10 

The lanes 1 and 2 show the positive control, lane 
13 the negative control. The figures to the left 
of lane 1 indicate the size of the nucleotide 
marker molecules. 

15 

Fig. 2 A: The result of a PGR test with subsequent Southern 
Blot for the detection of the sensitive exclusion 
of integrated replicon DNA (plasmid molecule 
I377/NS3-3 * /wt ) in a cell clone containing a HCV- 
20 RNA construct (9-13) . 



The lanes 7 to 11 represent the result of a 
titration of DNA molecules of the construct 
l377/NS3-'3 ' /wt without addition of total DNA of 

25 the cell clone 9-13, and the lanes 2-6 

represent the result obtained with the same 
plasmid molecules with the addition of 1 yg 9-13 
DNA each prior to the PGR (for the purpose of 
excluding an inhibitor of the PGR in the DNA 

30 preparation) . Lane 13 represents the negative 

control (PGR without DNA template) . Lane 1 shows 
the result, which was achieved with one pg total 
DNA of the cell clone 9-13. 



35 Fig. 2 B: The result of a Northern Blot test to quantify 
the amounts of HCV plus- and minus-strand RNA. 
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The arrows mark the positions of replicon RNA. 

The "plus" and "minus" details indicate the 
positive (plus) and negative (minus) polarity of 
5 the RNA controls, which have been applied to the 

gel. "Minus-strand" and "Plus-strand" indicate 
the specificity of the radioactive RNA probes. 



Fig. 2 C: The result of a f ormaldehyde-agarose gel 
10 electrophoresis after radioactive labeling of the 

intracellular replicated HCV-RNA to demonstrate 
the resistance of HCV-RNA replication to 
dactinomycin . 



15 Fig. 3 A: The detection of HCV specific antigens in the 
selected cell clones by means of 

immunoprecipitation after metabolic radioactive 
labeling . 



20 The lanes 7-9 represent authentic size marker 

(which have been obtained in Huh 7 cells after 
the transient expression of a HCV-RNA construct) ; 
identified HCV proteins are labeled on the left 
edge of lane 1, the molecular weights (in 

25 Kilodalton) are specified to the right of lane 9. 

Fig. 3 B: Results of an immunofluorescence test to 
establish the subcellular localisation of HCV 
antigens . 

30 

Fig. 4 : Diagram of the structure of a selectable HCV-RNA 
construct according to the invention (complete 
genome) consisting of the 5' HCV-IRES, the 
neomycin phosphotransferase gene (NeoR) , a 
35 heterologous IRES element, for example from the 

encephalomyocarditis virus (E-I), the complete 
HCV open reading frame and the authentic 3' NTR. 
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Diagram of the structure of HCV-RNA constructs 
with inserted antibiotic resistance gene (A) 
within the nucleotide sequence encoding the 
poiyprotein (monocistronic RNA construct) , and 
(B) within the 3' NTR (bicistronic RNA 
construct) • 

Diagram of the structure of HCV-RNA constructs 
with inserted reporter genes (A) as part of a HCV 
replicon from NS3 to NS5B, - in the end the 
reporter protein is cleaved by viral or cellular 
proteases out of the poiyprotein or the marker 
gene for selection (selection gene) or the 
resistance gene are transfected into the cells 
through co-transf ect ion (B) as part of a 

fusion gene composed of a resistance and reporter 
gene (for example for the neomycin 
phosphotransferase and green fluorescent 
protein) , (C) as part of a replicon composed of a 
resistance and reporter gene (for example the 
neomycin phosphotransferase and green fluorescent 
protein) bound via a nucleotide sequence, which 
encodes for an amino acid sequence (area with 
hatches), which can be cleaved by a protease or 
has a self -cleaving (autocatalytic) activity, (D) 
as independent gene (in this case the green 
fluorescent protein) , which is expressed from its 
own internal ribosome binding site (IRES); — the 
resistance gene (in this case the neomycin 
phosphotransferase gene) is also expressed from 
its own internal ribosome binding site (IRES) 
(polycistronic construct) . 

Diagram of the structure of a HCV-RNA construct, 
where the resistance gene is linked to the HCV- 
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RNA sequence via a ribozyme or a recognition site 
for a ribozyme. 

The thick lines illustrate the HCV 5* and 3' 
NTRs, E-I is a heterologous internal ribosome 
binding site, which is required for the 
expression of the resistance gene, and the grey' 
square illustrates the ribozyme or a recognition 
site for a ribozyme. 

Fig. 8: Diagram of the structure of a HCV-RNA construct 
with resistance gene and integrated foreign gene. 

Fig. 9: Method for comparing the specific infectivity 

15 (expressed as number of cell colonies obtained) 

of total RNA against in vitro transcripts. HCV- 
RNA is generated by in vitro transcription of a 
corresponding RNA construct and quantified by 
measurement of the optical density at 2 60 nm (OD 

20 260 nm) . A defined number of these molecules is 

mixed with a specified amount of total RNA from 
naive Huh-7 cells and this mixture is transfected 
into naive Huh-7 cells with the aid of 
electroporation. At the same time the total RNA 

25 of a cell clone, produced by the method described 

in Figure 1, is isolated using a known state of 
the art method and the amount of HCV-RNA 
contained therein is determined by means of 
Northern Blot using a HCV specific RNA probe and 

30 subsequent quantification via phosphoimaging . A 

defined amount of this total RNA is analogously 
transfected in naive Huh-7 cells. These cells 
from both the cultures are then subjected to a 
G418 selection and the number of colonies created 

35 is determined by counting after fixing and 

staining with Coomassie Brilliant Blue, For the 
determination of transfection efficiency l\ig of a 
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plasmid allowing the expression of luciferase is 
added to each transfection culture. An aliquot of 
the transfected cells is collected after 24 hours 
and the luciferase activity determined in the 
5 respective cell lysates. The number of colonies 

is always normed to the luciferase expression. 

Fig. 10: Sequence analysis of the 9-13 clone. Total RNA of 
the 9-13 cell clone, resulting from transfection 

10 of HCV-RNA construct l377/NS3-3\ was isolated 

using a known state of the art method and the 
HCV-RNA construct amplified from nucleotide 
position 59 to 9386 with the aid of 'long- 
distance RT-PCR* using primer S59 and A9413. The 

15 PGR fragments were cloned and 11 clones (9-13 A - 

K) completely sequenced. Clones D and I, E and G 
as well as H and J turned out to be identical, 
respectively. The positions of the amino acid 
differences in the NS3-5B region between the 

20 recloned HCV-RNAs and the parental construct are 

marked with a thick vertical mark for each clone. 
Each clone was digested with restriction enzyme 
Sfil and the respective fragment inserted in the 
parental construct. These clones were each 

25 transfected in Huh-7 cells and the cells 

subjected to selection as described in Figure 1. 
The number of" cell clones obtained with each 
construct is noted next to the respective 
construct on the right. 

30 

Fig. 11 A: Principle of determination of replication with 
the aid of a reporter gene. In the upper part of 
the figure, the HCV-DNA construct l389/Luc/NS3-3 • 
is shown. It consists of the HCV 5' NTR 
35 (nucleotide positions 1-389) , the luciferase gene 

(iuc), the IRES of the encephalomyocarditis 
virus, the HCV NS3-5B and the 3' NTR. The 
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position of the active centre of the NS5B RNA 
polymerase, into which a deactivating amino acid 
exchange was introduced, is indicated by 'GND'. 
The plasmids, which code for the HCV-RNA 
5 construct which is able to replicate or is 

defective, are digested with the restriction 
enzyme Seal and added to an in vitro 
transcription reaction with T7 RNA polymerase. 
After removal of the template DNA, the respective 
10 HCV-RNA constructs were transfected in naive Huh- 

7 cells by means of electroporation and the 
latter collected at regular intervals. 

Fig. 11 B: Comparison of lucif erase activity in cells 
15 transfected with the parental HCV-RNA construct 

l389/Luc/NS3-3 Vwt (wt) or the following variants: 
inactive RNA (318 DN) , variants 9-13F or variant 
5.1. The cells were collected 6 (not shown), 24, 
48, 72, 96, 120, 144 and 168 hours after 
20 transfection and luciferase activities determined 

by luminometric measurement - 

Fig. 12: Selectable HCV full-length genomes (constructs 
l389/core 3V5.1 and lasg/core 3V9-13F}. 
25 (A) Diagram of the full-length construct. The 

region between both indicated recognition sites 
for the restriction enzyme Sfil corresponds to 
the sequences of the highly adapted RNA variants 
5.1. or 9-13F. 

30 (B) Number of colonies which were obtained after 

transfection of 0.1 \ig in vitro transcribed RNA 
of the construct l389/core-3 V5 . 1 . described under 
A into HUH7-cells. The result of a representative 
experiment is given. 

35 (C) Demonstration of autonomously replicating HCV 

full-length RNAs in G418 resistant cell clones 
which were obtained after transfection of the 
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corresponding in vitro transcript. The 

illustration shows the autoradiogram of a 
Northern Blot, which was hybridised with a probe 
against the neo-resistance gene and the HCV 5' 
NTR. The controls shown in lanes 1 and 2 each 
correspond to 10^ molecules of the indicated in 
vitro transcripts, mixed with total RNA from 
naive Huh-7 cells. The negative control contains 
only total RNA from naive Huh-7 cells (lane 3) . 
Lanes 4-9 contain 3-10 pg total RNA from G418 
resistant cell clones, which were obtained after 
transfection by in vitro transcribed 1389/core 
3V5.1 RNA or 1389/core 3V9~13F RNA. The G418 
concentration used for the selection is indicated 
in each case. Five of the cell clones shown 
contain the highly adapted RNA variant 5.1 (lanes 
4-8), one contains the adapted RNA variant 9-13F 
(lane 9) - 

HCV-RNA constructs with a reporter gene. (A) 
Bicistronic HCV-RNA constructs. The reporter gene 
is translated with the aid of a separate IRES. 
(B) Monocistronic HCV-RNA constructs. The 
reporter gene product is expressed as fusion 
protein with a HCV protein. Both portions are 
linked via a recognition sequence for a viral or 
cellular protease, which permits a proteolytic 
separation of the two fused protein portions. In 
the example shown the reporter gene product and 
respective HCV protein was fused through a 
recognition sequence for ubiquitin (Ub) . 

Tricistronic full-length HCV-RNA construct, that 
in addition to the resistance gene possesses an 
inserted foreign gene (ther. gene) • 
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Fig- 15: Monocistronic HCV-RNA constructs^ for which the 
resistance gene product is expressed as a fusion 
protein with HCV portion. The resistance gene 
(RG) is either active as a fusion protein or it 

5 is fused with the HCV portion via a 

proteolytically cleavable sequence in such a way 
that the resistance gene product is split from 
the HCV portion by a cellular or viral protease - 
In the example shown the resistance gene was 

10 fused with the respective HCV portion through the 

sequence coding for ubiquitin (Ub) . 



15 Example 1 : Production of HCV-RNA constructs 

(A) Synthesis and cloning of a complete HCV consensus 
genome by means of RT-PCR 

20 The HCV genome, which means the HCV-RNA, was isolated from 
the liver of a chronically infected patient as described in 
the following: 

The total RNA was isolated from approx. 100 mg of liver 
25 according to the method described by Chomczynski and Sacci 
(1987, Anal. Biochem. 162, 156). Using 1 \ig of this 
isolated RNA a reverse transcription was carried out with 
the primers A6103 (GCTATCAGCCGGTTCATCCACTGC) or A9413 
(CAGGATGGCCTATTGG CCTGGAG) and the expand reverse 
30 transcriptase system (Boehringer Mannheim, Germany) 
according to the manufacturer's recommendations. A 
polymerase chain reaction (PCR) was carried out with the 
products of this reverse transcription using the expand 
long template system (Boehringer Mannheim, Germany) , in 
35 which the buffer containing 2 % of dimethyl sulfoxide was 
used. After one hour at 42 °C, 1/8 of the reaction mixture 
was used for a first PCR with primers A6103 and S59 
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(TGTCTTCACGCAGAAAGCGTCTAG) or A9413 and S4 5 42 (GATGAGCT 
CGCCGCGAAGCTGTCC) . After 40 cycles, 1/10 of this reaction 
mixture was used for a second PGR with primers S59 and 
A4919 (AGCACAGCCCGCGTCATAGCACTCG) or S4542 and A9386 
5 (TTAGCTCCCCG TTCATCGGTTGG . After 30 cycles the PGR products 
were purified by means of preparative agarose gel 
electrophoresis and the eluted fragments were ligated into 
the vector pCR2 , 1 (Invitrogen) or pBSK II (Stratagene) . 
Four clones of each fragment were analyzed and sequenced, 

10 and a consensus sequence was established. For this purpose 
the DNA sequences were compared to each other. The 
positions, where the sequence of one fragment was different 
from the others, were considered as undesired mutations. In 
the case of ambiguities of the sequence, shorter 

15 overlapping PGR fragments of the respective region were 
amplified and several clones sequenced. By this means 
several potential mutations could be identified in each 
fragment and consequently an isolate specific consensus 
sequence could be established. This established consensus 

20 sequence or genome belongs to the worldwide spread genotype 
lb. The non-translated region at the 3' end ( = 3' NTR) was 
obtained by means of a conventional PGR, whereby an 
antisense primer, which covers the last 24 nucleotides of 
the 'X-tail' known from the state of the art was used 

25 (Tanaka et al., 1995, Biochem. Biophys . Res. Gommun. 215, 
744; und Rice, PGT/US 96/14033) . The authentic non- 
translated region on the 5* end (=5' NTR) downstream of the 
T7 promoter was created by means of PGR, whereby an 
oligonucleotide, which corresponds to a shortened T7 

30 promoter (TAA TAG GAG TGA GTA TAG) and the first 88 
nucleotides of HGV, was used on one hand and one of the 
previously mentioned plasmids, which carries one of the 5' 
fragments of the genome, was used on the other hand. A 
complete HCV consensus genome was assembled from the 

35 subgenomic fragments with the smallest number of non- 
consensus replacements and inserted into a modified pBR322 
vector. Deviations from the consensus sequence were 
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eliminated by means of site-directed mutagenesis . In order 
to produce run-off transcripts with an authentic 3' end^ 
the 3* NTR of the isolates (with the end TGT) was modified 
to AGT (according to the sequence of the genotype 3 == clone 
5 'WS* according to Kolykhalov et al . , 1996, J. Virol. 70, 
3363) and an additional nucleotide replacement was carried 
out at position 9562, to preserve the A:T pairing in the 
hairpin structure at the 3' end of the 3' NTR (Kolyhalov et 
al. ibid.)- In order to eliminate an internal restriction 
10 site for the Seal enzyme, a silent nucleotide replacement 
was further carried out. After joining the full-length 
genome with the appropriate 5' and 3* NTRs the complete HCV 
sequence was analyzed. No undesired nucleotide replacement 
was detected. 

15 

The HCV genome produced in this way should be hepatotropic 
according to the definition. 

(B) Synthesis of selectable HCV subgenomic constructs 

20 

By means of the consensus genome described under (A) , HCV 
subgenomic constructs were created, which include the 
antibiotic resistance gene neomycin phosphotransferase 
(NPT) and two sequences of internal ribosome entry sites 

25 (IRES) . The biochemical procedures used for this are known 
and familiar to the expert (see: Sambrook, J., E.F, 
Fritsch, T. Maniatis, 1989, Molecularcloning : a laboratory 
manual, 2nd ed.. Cold Spring Harbour Laboratory, Cold 
Spring Harbor, N.Y.; Ausubel et al. (eds.), 1994, Current 

30 Protocols in Molecular Biology, Vol. 1-3, John Wiley & Sons 
Inc., New York). The antibiotic resistance gene was 
inserted immediately after the 5' NTR, obtaining a 
bicistronic RNA as a result (see Fig. 1 A) . However, the 
antibiotic resistance gene might as well be inserted at 

35 another site of the HCV subgenomic construct, for example 
within the nucleotide sequence encoding for the 
polyprotein, obtaining a monocistronic RNA as a result (see 
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Fig. 5 A) , or in the 3' NTR (see Fig. 5 B) . The IRES 
elements are one of the two HCV-IRES variants from 
nucleotides 1-377 or nucleotides 1-389 as well as the IRES 
of the encephalomyocardit is virus^ which controls the 
5 translation of the HCV sequence downstream from the genes 
for NS2 or NS3 up to the authentic 3' end of the genome. 

The two HCV-IRES variants mentioned were determined in the 
following way: 

10 

Based on deletion analyses of the 3' end of the HCV-IRES 
(Reynolds et al.l995, EMBO J. 14, 6010) various segments of 
the 5' NTR were fused with the NPT gene and analyzed with 
regard to the maximum number of colonies formed by means of 

15 co-transf ection with a plasmid containing the T7 RNA 
polymerase gene. The best results were achieved with the 
HCV sequences from 1-377 to 1-389. Due to the fact that the 
AUG start codon of the HCV polyprotein is located at 
position 342 and is consequently included in the IRES 

20 sequence, 12 or 16 amino acids of the HCV core protein 
("core protein") were fused with the neomycin 
phosphotransferase (see Fig. 1 A) . 

Accordingly, these modified HCV subgenomic constructs were 
25 given the designations I377/NS2-3' (or I377/NS3-3') and 
I389/NS2-3' (or I389/NS3-3 ' ) . They are illustrated in a 
diagram in Fig. lA. 

Different cell lines and primary cell cultures of human 
30 hepatocytes were transfected with in vitro transcripts of 
these modified parental HCV subgenomic constructs I377/NS2- 
3' (or I377/NS3-3') and I389/NS2-3' (or I389/NS3-3 ' ) . 

As a parallel negative control for all transfection 
35 experiments, an appropriately modified, but defective 
subgenome was constructed for every modified parental HCV 
subgenomic construct, which differs from the parental 
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construct due to the fact, that it has a deletion of 10 
amino acids within the reading frame comprising the active 
site of the NS5B RNA polymerase (Behrens et al., 1996, EMBO 
J. 15, 12; and Lohmann et al., 1997, J. Virol. 71, 8416). 

5 

(C) Synthesis of selectable HCV genomic constructs 

A NS2-3' subgenomic construct, linked at its 5* end to a 
fragment of the luciferase gene and the complete EMCV-IRES, 

10 was restricted with Ncol and Spel and purified using 
preparative agarose gel electrophoresis- The vector 
obtained in this way was ligated with a Ncol/Not I-HCV 
fragment corresponding to the nucleotide positions 342 to 
1968 of the HCV genome and a Notl/Spel fragment 

15 corresponding to the nucleotide positions 1968-9605 in a 3- 
f actor-ligation . The resulting construct, where the 
complete HCV open reading frame and the 3' NTR lie 
downstream from the luciferase gene fragment and the EMCV- 
IRES, was then restricted with Pmel and Spel and ligated 

20 with the analogously restricted I389/NS3-3 ' /wt subgenomic 
construct vector. This selectable HCV genomic construct is 
illustrated in Fig. 4, 

(D) Production of in vitro transcripts corresponding to the 
25 HCV-RNA constructs 

The previously described purified plasmid DNAs were 
linearized with Seal and used for an in vitro transcription 
reaction after phenol/chloroform extraction and isopropanol 

30 precipitation using the following components: 80 mM HEPES, 
pH 7.5, 12,5 mM MgCla, 2 mM Spermidine, 40 mM 
Dithiothreitol, 2 mM of each NTP, 1 unit RNasin/ial, 50 
pg/ml restricted DNA and approx. 2 units/pl T7 RNA 
polymerase. After 2 hrs. at 37 °C half of the amount of T7 

35 polymerase was added and the reaction mixture was incubated 
for two further hours. In order to remove DNA the mixture 
was extracted with acid phenole (U. Kedzierski, J.C. Porte, 
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1991, Bio Techniques 10, 210), precipitated with 
isopropanol, the pellet was dissolved in water and 
incubated with DNase (2 units per \ig DNA) for 60 min. at 37 
"C. After subsequent extraction with acid phenole, acid 
5 phenole/chlorof orm and chloroform as well as isopropanol 
precipitation the dissolved RNA was quantified using 
optical density measurement and its integrity was checked 
using f ormaldehyde-agarose gel electrophoresis. 



10 

Example 2 : Transfection experiment:s with the hepatoma cell 
line Huh 7 

With all transfection experiments it was carefully ensured 
15 that any template DNA had been removed beforehand so as to 
avoid the possibility of this DNA integrating in 
transfected cells and conferring a neomycin resistance to 
them independent from HCV replication. The reaction mixture 
was therefore treated with 2 units of DNase per pg DNA for 
20 60 min. at 37 °C after the in vitro transcription and 
extracted with acid phenole, acid phenole/chlorof orm and 
chloroform. Prior to being used for the transfection the 
precipitated RNA was analyzed using f ormaldehyde-agarose 
gel electrophoresis , 

25 

Three separate transfection experiments were carried out 
with the highly differentiated human hepatoma cell line Huh 
7 (according to Nakabayashi et al. 1982, Cancer Res. 42, 
3858) • Each time 15 pg RNA were introduced into 8 x 10^ Huh 

30 7 cells by electroporation and the cells were seeded in 
culture dishes with a diameter of 10 cm. 24 hours after 
seeding, neomycin (= G418) was added in a final 
concentration of 1 mg/ml. The culture medium was changed 
twice a week. After 3-5 weeks small colonies were 

35 visible, which were isolated and grown under the same 
culture conditions . 
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The cell clones obtained during the first experiment were 
isolated and subpassaged. Most of the clones died during 
this procedure, and the final yield was only 9 clones from 
cells, which had been transfected with the parental HCV 
5 subgenomic constructs and 1 clone (clone 8-1) from cells, 
which had been transfected with a defective HCV genomic 
construct, namely a defective NS2-3' HCV-RNA. Apart from an 
extended doubling time and the occasional occurrence of 
irregularly shaped cells, no consistent morphological 
10 differences were found between these 9 cell clones and the 
single cell clone (clone 8-1) or the parental Huh 7 cells. 

The main criteria for functioning HCV genomic constructs 
are the formation of viral RNA with the correct size and 
15 the absence of (integrated) plasmid DNA, which could 
transfer or mediate on a G418 resistance. 

To determine the HCV-RNA in the Huh 7 cells, the total RNA 
was isolated and analyzed by means of the common Northern 

20 Blot method using a plus-strand specific ribo probe (RNA 
probe) . For this purpose the total RNA was isolated from 
the respective cell clones according to the method 
described by Chomczynski and Sacchi 1987, Anal. Biochem. 
162, 156 and 10 ]jg RNA, which is equivalent to the total 

25 RNA content of 0,5 - 1 x 10^ cells, are separated using 
denatured f ormaldehyde-agarose gel electrophoresis (lanes 3 
to 12 of Fig. 1 B) . At the same time 10^ in vitro 
transcripts (ivtr.), which correspond to the I389/NS2-3 ' /wt 
or the I389/NS3-3 ' /wt replicon RNAs, are separated as well 

30 as size markers with authentic sequence (lane 1 or lane 2) . 
The separated RNA was transferred onto nylon-based 
membranes and hybridized with a radioactively labeled plus- 
strand specific RNA probe, which was complementary to the 
complete NPT gene and the HCV-IRES of nucleotide 377 to 

35 nucleotide 1. The positions of the HCV specific RNAs 
(arrows) and the 28S rRNA are specified to the right of 
lane 12, the size (amount of nucleotides) of the RNA marker 
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is specified to left of lane 1. The RNA marker fragments 
contain HCV sequences and therefore hybridize with the ribo 
probe (= RNA probe) . The results of this analysis are 
illustrated in Fig. 1 B. 

5 

With the exception of clone 8-1 transfected with the 
defective HCV genomic construct, all cell clones produced 
homogenous HCV-RNAs of correct lengths (approx. 8640 
nucleotides in the case of the NS2-3' and approx. 7970 

10 nucleotides in the case of NS3-3' replicon) - This result is 
an indication of the fact, that the functional replicons or 
the functional HCV genomic constructs transfer the G418 
resistance. In order to exclude, that the G418 resistance 
is caused by a plasmid DNA, which is integrated in the 

15 genome of the Huh 7 host cell and transcribed under control 
of a cellular promoter, the DNA of each clone was analyzed 
by means of a NPT gene specific PCR. Consequently the DNA 
was isolated from the selected Huh 7 cell clones by means 
of digestion with proteinase K (40vig/ml, Ih, 37 °C) in 

20 lOmMTris, pH7,5, ImM EDTA, 0,5 % SOS and subsequent 
extraction with phenol, phenol/chloroform and isopropanol 
precipitation. The DNA precipitate was dissolved in 10 mM 
Tris (pH 7,5) and 1 mM EDTA and incubated with Rnase A for 
1 hour. Following a phenol/chloroform extraction and 

25 ethanol precipitation, 1 ]ag DNA, equivalent to 4 - 8 x 10^ 
cells, was analyzed by means of PCR using NPT gene specific 
primers ( 5 ' -TCAAGACCGACCTG TCCGGTGCCC-3 ' and 5'- 

CTTGAGCCTGGCGAACAGTTCGGC-3 M f and a DNA fragment consisting 
of 379 nucleotides was generated. The specificity of the 

30 PCR product was established by means of the Southern Blot 
method, in which a DNA fragment labeled by digoxigenin was 
used, which corresponds to the NPT gene. As positive 
controls (for the detection of possibly contaminating 
nucleic acids) the PCR method was carried out with 10^ 

35 plasmid molecules or 1 ]ig DNA from a BHK cell line, which 
was stably transfected with a neomycin resistance gene, and 
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as negative control, the PGR was carried out with the same 
reagents but without added DNA. 

The results of this analysis are illustrated in Fig. 1 C. 
5 The lanes 1 and 2 represent the positive controls, lane 13 
represents the negative control. The numbers to the left of 
lane 1 indicate the sizes of the nucleotide marker 
molecules . 

10 A NPT-DNA could not be detected in any cell clone, apart 
from clone 7-3 (Fig.lC, lane 3), which was obtained from 
cells after transfection with a NS2-3 ' replicon/NS2-3 * HCV 
genomic construct, and clone 8-1 (Fig.lC, lane 12), which 
was obtained from cells after transfection with a defective 

15 HCV genomic construct. This result was another indication 
of the fact, that the G418 resistance of most clones was 
passed on by the replicating HCV-RNA. But even regardless 
of these results, it is unlikely, that integrated plasmid 
DNA produces HCV-RNAs of correct sizes, because the 

20 plasmids used for the in vitro transcription contain 
neither an eukaryotic promoter nor a polyadenylation 
signal. In the case of clone 7-3 the resistance is 
therefore very probably passed on by the HCV-RNA construct 
or the replicating HCV-RNA as well as by an integrated NPT 

25 DNA sequence, whereas the resistance of the cells of clone 
8-1 is only caused by the integrated plasmid DNA. 

Clone 9-13 (Fig. 1 B, lane 11) was subject to further 
tests, to confirm that the G418 resistance was passed on by 

30 a self-replicating HCV-RNA. Clone 8-1, which carries the 
integrated copies of the NPT gene, was used throughout as 
negative control. A PCR was carried out, which allows the 
detection of < 1000 NPT gene copies in 40.000 cells, with 
the aim to rigorously exclude the presence of NPT-DNA in 

35 clone 9-13. The result of this PCR is illustrated in Fig. 
2A. The PCR proceeded in detail as follows: 
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During the test, 10^ - 10^ plasmid molecules (I377/NS3- 
3Vwt) were used either directly (lanes 7 - 11) or after 
adding each 1 pg 9-13 DNA (lanes 2-6). The specificity of 
the amplified DNA fragment was determined by Southern Blot 
5 using a NPT specific probe. A PGR without DNA probe was 
carried out as negative control (lane 12). 

Even with this sensitive method, no plasmid DNA could be 
detected in one pg DNA of the cell clone 9-13 (lane 1) . To 

10 estimate the amount of HCV plus- and minus-strand RNAs in 
these cells, a dilution series of total RNA was analyzed 
with the Northern Blot method using a plus- or minus-strand 
specific radioact ively labeled ribo probe (= RNA probe) . 
For this purpose, 8, 4 or 2 pg of total RNA, which have 

15 been isolated from the cell clones 9-13 and 8-1, were 
analyzed in parallel to known amounts of in vitro 
transcripts with plus- or minus-strand polarity (control 
RNAs) in the Northern Blot method and were then subjected 
to a hybridization. The hybridization was carried out with 

20 a plus-strand specific ribo probe, which covered the 
complete NPT gene and the HCV-IRES ('plus-strand', top 
panel) , or with a minus-strand specific RNA probe, which 
was complementary to the NS3 sequence ('minus-strand', 
bottom panel) . The arrows mark the positions of replicon 

25 RNA. The results of this analysis are illustrated in Fig. 
2 B. 

Approx. 10^ copies/pg total RNA were detected in the case 
of the plus-strand, which is equivalent to 1000 - 5000 HCV- 
30 RNA molecules per cell, whereas the amount of minus-strand 
RNA was 5- to 10-times less. This result corresponds to the 
assumption, that the minus-strand RNA is the replicative 
intermediate form or intermediate copy, which is used as a 
template for the synthesis of the plus-strand molecules. 

35 

Due to the fact that the reaction is mainly catalyzed by 
the viral RNA-dependent RNA polymerase, the synthesis of 
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the HCV-RNAs should be resistant to dact inomycin, an 
antibiotic^ which selectively inhibits the RNA synthesis of 
DNA templates^ but not the RNA synthesis from RNA 
templates. To confirm this assumption, cells were incubated 
5 with [^H] uridine in the presence of dact inomycin, the 
radioactively labeled RNAs were extracted, separated by 
means of denaturing agarose gel electrophoresis and 
analyzed with the aid of a common Bio-Imager using a [^H]- 
sensitive screen. For this purpose approx. 5 x 10^ cells of 

10 the clones 9-13 and 8-1 were incubated at a time with 100 jj 
Ci ["^H] uridine for 16 hrs . in the absence (-) or presence 
( + ) of 4 ijg/ml of dactinomycin (Dact) . Following this 
labeling reaction the total RNA was prepared and analyzed 
by means of f ormaldehyde-agarose gel electrophoresis. Only 

15 1/10 of the total RNA is illustrated in the first two 
lanes. The radioactively labeled RNA was visualized using a 
BAS-2500 Bio-Imager (Fuji) . 

The results of this analysis are illustrated in Fig. 2 C- 
20 In accordance with the inhibitor profile of the NS5B 
polymerase (Behrens et al., 1996, EMBOJ. 15, 12 and Lohmann 
et al., 1997, J. Virol, 71, 8416), dactinomycin had no 
influence on the replication of the HCV RNA, whereas the 
synthesis of cellular RNA was inhibited- A RT-PCR was 
25 carried out for recloning the replicating sequences, to 
confirm the identity of the viral RNA. The sequence 
analysis of the recloned RNA showed that the RNA in clone 
9-13 is HCV specific and corresponds to the transfected 
transcript of the HCV construct I377/NS3-3 ' /wt . 

30 

For the analysis of the viral proteins, at first the 
respective cells were metabolically radioactively labeled 
with ['^^S] methionine/cysteine, then lysed and afterwards 
the HCV specific proteins were isolated from the cell 
35 lysates by means of immunoprecipitation . The results of 
these analyses are illustrated in Fig. 3 A, The detailed 
procedure was as follows: Cells of the cell clones 9-13 
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(wt) and 8-1 (A) were metabolically radioact ively labeled 
by treating them with a protein labeling mixture familiar 
to the expert and available on the market (for example. NEN 
Life Science) . The HCV specific proteins were separated 
5 from the cell lysate by immunoprecipitation (IP) under non- 
denaturing conditions (for example according to 
Bartenschlager et al., 1995, J. Virol. 69, 7519) using 
three different antisera (3/4, 5A, 5B, according to the 
labeling on the top end of the lanes 1 to 12) . The immune 

10 complexes were analyzed by means of tricine SDS-PAGE and 
made visible by means of autoradiography. To obtain 
authentic size markers, the homologous replicon construct 
I377/NS3-3 Vwt was subject to a transient expression by the 
vaccinia virus-T7 hybrid system in the Huh 7 cells. The 

15 resulting products were used as size markers (lanes 7-9) 
parallel to the cells of the clones 9-13 and 8-1. 
Identified HCV proteins are labeled on the left edge of 
lane 1, the molecular weights (in Kilodalton) are specified 
on the right edge of lane 9. It should be noted that the 

20 NS3/4 specific antiserum ('3/4*) used preferably reacts 
with NS4A and NS4B causing an underrepresentation of NS3. 

All viral antigens could unambiguously be detected, and 
their apparent molecular weights did not show any 

25 difference to those being detected after transient 
expression of the same bicistronic HCV-RNA construct in the 
original Huh 7 cells. An immunofluorescence detection 
reaction was carried out using NS3 and NS5A specific 
antisera, to determine the subcellular distribution of the 

30 viral antigens (for example according to Bartenschlager et 
al., 1995, J. Virol. 69, 7519). For this purpose cells of 
the clones 9-13 (wt) and 8-1 (A) were fixed with 
methanol/acetone 24 hrs . after incubating on coverslips and 
incubated with polyclonal NS3 or NS5A specific antisera. 

35 The bound antibodies were made visible with a commercially 
available FITC conjugated anti-rabbit antiserum. The cells 
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were counters tained with the 'Evans Blue* stain to suppress 
unspecific fluorescence signals. 

The results of this detection test are illustrated in Fig. 
5 3 B. A strong fluorescence in the cytoplasm could be 
detected with both antisera. The NS5A specific antiserum 
also caused a slight nuclear fluorescence, which indicates 
that at least small amounts of this antigen also reach the 
nucleus. But the generally dominating presence of the viral 
10 antigens in the cytoplasm are a strong indication that HCV- 
RNA replication occurs in the cytoplasm, as is the case 
with most RNA viruses. 

These results prove clearly that the establishment of a 
15 cell culture system for the HCV could be accomplished with 
the test arrangement described, the efficiency of which 
surpasses everything known up until now by far and for the 
first time allows the detection of viral nucleic acids and 
proteins with conventional and approved biochemical 
20 methods. This efficiency actually allows detailed 
examination of HCV pathogenesis, genetic analyses of 
different HCV functions and a precise study of the 
virus/host cell interaction, through which new starting 
points for the development of a antiviral therapy can be 
25 defined. 

Example 3 : Transfecbxon of Huh 7 cells with HCV totial 
cons-brucbs 

30 

HuhV cells are transfected and selected as described in 
example 2, whereby in this case selectable constructs are 
used, which contain the complete virus genome. 
Corresponding to Example 2, the resulting cell clones are 
35 tested for the absence of HCV-DNA by means of PCR and the 
productive replication of HCV-RNA is then established by 
means of Northern Blot, ["^H] uridine labeling in the 



CA 02303526 2000-03-31 

- 50 - 

presence of dact inomycin, detection of the viral proteins 
or antigens preferably with the aid of the Western Blot, 
the immunoprecipitation or immunofluorescence. In contrast 
to the arrangements described in Example 2, the construct 
5 described here makes it possible to obtain more complete 
and very likely infectious viruses, which has not been the 
case in the subgenomic constructs described in Example 2. 
These viruses existing in the cell and the cell culture 
supernatant are concentrated for example by means of 

10 ultracentrif ugation, immunoprecipitation or precipitation 
with polyethyleneglycol, and all exogenous nucleic acids, 
which means those that are not incorporated into the virus 
particle^ are digested by incubation with nucleases (RNase, 
DNase, micrococal nuclease) . In this way all contaminating 

15 nucleic acids, which are not included in the protecting 
virus particle, can be removed. After inactivation of the 
nucleases, the protected viral RNA is isolated for example 
by means of incubation with proteinase K in a buffer 
containing SDS, by extracting with phenol and 

20 phenol/chloroform and detected by means of Northern Blot or 
RT-PCR using HCV specific primers. Also in this test 
arrangement, the combination of the HCV consensus genome 
described with a selection marker was crucial for the 
efficient production of viral RNA, viral protein and 

25 therefore HCV particles. 

Example 4 : Production and application of a HCV- 
PNA construct, whereby the resistance gene is 
30 linked to the HCV subgenomic seq[uence via a 

ribozyme or a recognition site for a ribozyme. 

A HCV-RNA construct is produced according to Example 1 or 
Example 3, where an antibiotic resistance gene is linked to 
35 the HCV-RNA sequence through a ribozyme or a recognition 
sequence for a ribozyme. These constructs are illustrated 
in a diagram in Fig. 7. Huh 7 cells are transfected with 
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this HCV-RNA construct as described in Example 2. A 
selection with the appropriate antibiotic follows the 
transfection into the cells. The inserted ribozyme is 
activated in the cell clones obtained in the procedure or, 
5 in the case of a construct, which carries a recognition 
sequence for a ribozyme, the ribozyme is transfected into 
the cell (for example by means of transfection of a 
ribozyme construct or infection with a viral expression 
vector, into which the respective ribozyme has been 

10 inserted) . In both cases the resistance gene is separated 
from the HCV-RNA sequence by the ribozyme-dependent 
cleavage. The result in the case of the HCV genome is an 
authentic HCV genome without a resistance gene, which can 
form authentic infectious virus particles. A HCV replicon 

15 without resistance gene is created in the case of the HCV 
subgenomic constructs . 

Example 5 : Co- transfection of a HCV-RNA construct with a 
20 separate luciferase transfection construct 

A HCV-RNA construct is produced according to Example 1 (A) 
or Example 3 or Example 4. At the same time a transfection 
construct is produced, which comprises the luciferase gene, 

25 whereby this luciferase gene is linked to a first 
nucleotide sequence, which encodes a HCV protease (for 
example NS3 protease) cleavage site, to a second nucleotide 
sequence, which encodes for another protein or a part of 
another protein. HCV-RNA construct and transfection 

30 construct are transfected into any host cells, preferably 
hepatoma cells, most preferably Huh 7 cells. This can be 
realised as described in Example 2. The product of the 
modified luciferase gene is a luciferase fusion protein, 
where the luciferase is inactivated due to the fusion with 

35 the foreign part. The fusion protein, which contains a 
recognition sequence for a HCV protease, is cleaved in 
transfected cells with high HCV replication, and 
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consequently the active form of the luciferase, which can 
be identified through luminomet r ic measurement^ is 
released. If the replication of the HCV-RNA construct is 
inhibited the fusion protein will not be cleaved and no 
5 active luciferase will be released. The quantitative 
determination of the luciferase is therefore a measure for 
the replication of the HCV construct. Instead of the 
luciferase gene, another reporter gene can just as easily 
be used, which is modified in the same way, so that its 

10 expression depends on the viral replication, although this 
reporter gene is not part of the HCV construct- A cellular 
protein, which is deactivated or activated by the HCV 
proteins or nucleic acid, can also be used as a so called 
surrogate marker. The expression or activity of this 

15 surrogate marker is in this case a measure for the 
replication of the viral DNA. 



Example 6 : Producbion of HCV subgenomxc constructs with 
20 integrated foreign genes to be used as liver 

cell specific vector in gene therapy 



These recombinant and selectable HCV subgenomic constructs 
are transfected in transcomplementing helper cell lines, 

25 which means in cell lines, which express the missing 
functions (for example the structural proteins) in an 
inducible or constitutive way. Cell clones containing a 
functional HCV subgenomic construct can be established 
through appropriate selection. The viral structural 

30 proteins expressed from the host cell allow the formation 
of virus particles, into which the RNA of the HCV 
subgenomic constructs will be transfected. The results are 
therefore virus-like particles, which contain a HCV 
subgenomic construct according to the invention including 

35 the inserted foreign gene and which can transmit this to 
other cells by means of infection. An example for this 
construct is illustrated in Fig. 8. 
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It is also possible to use the HCV subgenomic construct 
with integrated foreign gene directly as an expression 
vector. This involves the same method as mentioned 
5 previously except that cell lines, which do not express 
transcomplementing factors, are transfected. In this case 
the HCV construct is only used as an expression vector. 

10 Example 7 : Production of cell culture adapted HCV-RNA 

constructs 

(A) Method of Isolation 

15 The following method was used to determine adaptive 
mutations and to produce cell culture adapted HCV-RNA 
constructs: cells were transfected with a HCV-RNA construct 
as described in Examples 1 and 2 and G418-resistant cell 
clone produced. For the determination of ability to 

20 replicate (understood in this context to be the number of 
G418 resistant cell clones obtained per microgram of 
transfected HCV-RNA or HCV-RNA construct), the total RNA 
from one of the cell clones, [the 9-13 clone (Fig. 1 B, 
lane 11)], was isolated and the quantity of HCV-RNA 

25 contained within it was determined by Northern Blot as 
described in Fig. 2 B. Ten micrograms of the total RNA, 
containing approx. 10^ molecules of HCV-RNA, was then 
transfected into naive Huh~7 cells using elect roporat ion 
(Fig. 9) . In parallel, 10^ in vitro transcripts of the 

30 analoguous neo-HCV-RNA, which had been adjusted with 
isolated total RNA from naive Huh-7 cells to a total RNA 
quantity of 10 ^g, were transfected in naive Huh-7 cells. 
After selection with G418, the number of cell colonies, 
expressed as 'colony forming units (cfu) per microgram 

35 RNA', was determined in both these cultures. At a 
concentration of 500 pg/ml G418 in the selection medium, 
the number of colonies obtained with the HCV-RNA contained 
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in Isolated total RNA from clone 9-13, was approx. 100,000 
cfu per microgram HCV-RNA. In contrast, only 30 - 50 
colonies were obtained with the same quantity of in vitro 
transcribed HCV-RNA. This result confirms that the specific 
5 infectivity of the HCV-RNA isolated from the cell clones is 
approx. 1,000 - 10,000 times higher than the infectivity of 
the analoguous in vitro transcripts. The experimental 
approach is shown in Fig. 9. 

10 With the aid of 'long-distance RT-PCR', the HCV-RNA was 
amplified from the total RNA of the 9-13 cells, the PGR 
amplificate was cloned and numerous clones were sequenced. 
A comparison of the sequences of these recloned RNAs with 
the sequence of the RNA originally transfected into the 

15 naive Huh-7 cells, showed that the recloned RNAs possessed 
numerous amino acid exchanges distributed over the whole 
HCV sequence (Fig. 10) . Sfil fragments of these recloned 
mutants were used to replace the analoguous Sjfil fragment 
of the original replicon construct, and RNAs of the 

20 respective mutants were transfected in naive Huh-7 cells. 
After selection with G418 the number of colonies created 
was determined for each HCV-RNA mutant. While only 30 - 50 
colonies per microgram RNA were obtained with the parental 
RNA the number of colonies was noticeably higher for two of 

25 the recloned variants (Fig. 10) . In the case of the HCV-RNA 
constructs 9-131 and 9-13C the specific infectivity was 
increased to 100 - 1, 000 cfu per microgram RNA and for 9- 
13F replicon it was 1, 000 - 10, 000 cfu per microgram RNA. 
These results show that the amino acid exchanges in the 

30 analysed NS3-5B regions of the mutants 9-131, 9-13C and 
particularly of 9-13F, led to a considerable increase in 
ability to replicate. In contrast all the other HCV-RNA 
constructs (9-13 A, B, G, H und K) were no longer able to 
replicate, they thus contained lethal mutations. 

35 

In order to answer the question which of the amino acid 
exchanges in the 9-13F construct led to an increase in 
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replication, the exchanges were introduced separately or in 
combination into the parental HCV-RNA construct, and the 
corresponding RNAs transfected in naive Huh-7 cells. The 
result of the transfection with these RNAs is summarised in 
5 Table 1. From this it is evident that in the present 
example the high ability to replicate is determined by 
several mutations. The amino acid exchanges in the HCV-RNA 
regions NS5A and NS4B make the greatest contribution. The 
single exchanges in the NS3-Region also make a contribution 
10 and perhaps they are synergistic. 

These results confirm that it was through the G418 
selection of the cells transfected with the neo-HCV-RNA 
construct that there was enrichment of those HCV-RNAs 

15 having noticeably higher ability to replicate. HCV-RNA 
constructs with greatly differing replication efficiencies 
can be selected using the experimental approach described 
here. The higher the concentration of the antibiotic in the 
selection medium, in/on which the HCV-RNA construct 

20 containing cells are cultivated for selection, the higher 
must be the extent of adaptive mutations and hence 
replication efficiency of the relevant HCV-RNA constructs^ 
to allow the cells to grow under these conditions. If the 
selections are carried out using lower antibiotic 

25 concentrations, cells can survive and multiply, but the 
HCV-RNA construct shows a comparatively lower replication 
efficiency and fewer adaptive mutations. 

As has been shown, the 9-13F HCV-RNA construct described so 
30 far, which contains several adaptive mutations, had a 
higher replication efficiency than the parental HCV-RNA. In 
order to obtain HCV-RNAs with even higher replication in 
cell culture, the HCV-RNA contained in the total RNA of a 
selected cell clone was passaged several times in naive 
35 Huh-7 cells. The selected 5-15 cell clone, was obtained by 
transfection with the HCV-RNA construct I389/NS3-3' (Fig. 
1) • It largely corresponds to the cell clone 9-13, produced 
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by transfection with a HCV-RNA construct, having a HCV-IRES 
shorter by 22 nucleotides ( I377/NS3-3 ' ; Fig. 1). Ten 
micrograms of total RNA, isolated from cell clone 5-15, 
were transfected into naive Huh-7 cells using 
5 electroporation and the cells subjected to a selection with 
1 mg/ml G418. The total RNA from one of the cell clones 
thus produced was again isolated, transfected into naive 
Huh-7 cells and selected in the same way. This process was 
repeated a total of four times- After the fourth passage 

10 the total RNA was isolated from a cell clone and the neo- 
HCV-RNA amplified with the aid of the 'long-distance RT- 
PCR' . The amplified DNA fragment was digested with the 
restriction enzyme Sfil and inserted into the 5:fil- 
restricted parental construct I389/NS3-3'. Over 100 DNA 

15 clones were obtained altogether and then analysed by means 
of restriction digestion. In vitro transcribed RNA of about 
80 of these clones was each transfected into naive Huh-7 
cells and subjected to a selection with 500mg/ml G418. Of 
the 80 neo-HCV-RNA variants examined, the great majority 

20 proved to be replication defective. However, the specific 
infectivity, expressed as 'colony forming units' per 
microgram RNA, was noticeably increased in the case of two 
mutants, 5.1 and 19 (Table 2). Through several passages of 
the RNA in cell culture it is clear that HCV-RNAs are 

25 produced whose replication efficiency due to mutations 
(^adaptive mutations'') is several orders of magnitude 
higher than the original RNA cloned from patients. 

(B) Modified method 

30 

Adaptive mutations produced and identified according to (A) 
can be transferred into a HCV-RNA construct with low 
ability to replicate. This leads to a huge increase in the 
replication of this construct. The increase is so great it 
35 can be demonstrated that HCV-RNAs transfected into cell 
culture can replicate even in the absence of selection 
pressure Fig. 12 shows a comparison of the replication 
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efficiency of HCV~RNAs, which corresponded either to the 
starting sequence or to the adaptive sequences 9-13F or 
5.1. For the purposes of simple measurement, the neo-gene 
was removed and replaced by the gene for lucif erase. The 
5 negative control used was again a HCV-RNA construct that 
was replication defective due to a deactivating mutation in 
the NS5B RNA polymerase. Already 24 hours after 
transfection a noticeable difference is evident in 
luciferase activity between the defective RNA and the 9-13F 

10 or 5.1 constructs, while hardly any difference could be 
seen between the defective RNA (318 DN) and the parental 
RNA construct (wt) that possessed no adaptive mutations. 
During the whole period of observation, the highest 
luciferase activity, and thus highest replication, was 

15 obtained with the 5.1-RNA. These results not only confirm 
the high replication efficiency of this RNA, but also show 
that it is possible to create a cell culture system with 
adapted HCV-RNA constructs for which the presence of a 
selectable gene is no longer necessary- A summary of the 

20 nucleotide and amino acid differences between the starting 
construct and the mutants 9-13F, 5,1 and 19 is presented in 
Table 3. 

25 Example 8: Production of cell culture adapted HCV--RNA 

full-length genome 

In the examples 1 to 7 a subgenomic HCV-RNA was used which 
lacked the whole structural protein region from core up to 

30 p7 or even NS2, It will be shown in this example that it is 
possible to make a HCV full-length genome replicate in cell 
culture with the aid of an adapted NS3-5B sequence. For 
this purpose the 5-fiI fragment of the highly adapted HCV- 
RNA 5.1 produced according to Example 7 is first 

35 transferred into a selectable HCV full-length genome (Fig, 
12) . This HCV genome was transfected into naive Huh-7 cells 
and subjected to selection with various G418 
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concentrations. Depending on the strength of selection (the 
G418 concentration), a varying large number of cell clones 
was obtained (Fig. 12 B) . By contrast no colonies were 
obtained with the parental HCV full-length genome 
5 containing no adaptive mutations, as was the case for the 
negative control, which was replication defective due to a 
deactivating mutation in the NS5B RNA polymerase. To 
confirm that the thus resulting cell clones really 
contained an autonomously replicating HCV full-length 

10 construct, total RNA from several cell clones was isolated 
and analysed by means of the Northern Blot method. The 
full-length HCV-RNA was clearly detectable in all cell 
clones (Fig. 12). It is thus clearly confirmed, that with 
the aid of cell culture adapted HCV sequences it is 

15 possible to produce a HCV full-length genome, which 
replicates highly efficiently and autonomously in a cell 
line, i.e. adapted HCV full-length genomes can also be 
produced with the system of the invention. Furthermore, as 
this clone possesses the complete HCV sequence, i.e. it 

20 also possesses the structural proteins necessary for virus 
particle formation, it is possible to produce large 
quantities of infectious virus particles in cell cultures 
with this system. As a confirmation of these viruses, cell- 
free cell supernatants carrying a replicating HCV full- 

25 length genome, are added to naive Huh-7 cells and the thus 
infected cells subjected to selection with G418. Each cell 
clone growing under these conditions originates from an 
infected cell. The viruses in the cell culture supernatant 
of cells possessing a replicating HCV full-length genome 

30 can be enriched and purified using various known state of 
the art methods such as ultracentrif ugation or 
microdialysis . They can then be used for the infection of 
naive cells. Using this method it is clearly demonstrated 
that cell culture adapted full-length genomes can be 

35 produced with the HCV cell culture system of the invention. 
These genomes replicate with high efficiency in cells and 
produce infectious viruses. The latter can be detected by 
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infection of an experimental animal, preferably a 
chimpanzee . 

5 Example 9 : Production of HCV full-length constructs and 

HCV subgenomxc constructs with reporter genes 

A HCV-RNA construct is produced in which a reporter gene is 
inserted in place of the antibiotic resistance gene (Fig. 

10 13) . Replication can thereby be determined through the 
quantity or activity of the reporter gene or reporter gene 
product. The reporter gene is preferably a gene from the 
group of the luciferase genes, the CAT gene 
(chloramphenicol acetyl transferase gene) , the lacZ gene 

15 (beta galactosidase gene) , the GFP gene (green fluorescence 
protein gene) , the GUS gene (glucuronidase gene) or the 
SEAP gene (secreted alkaline phosphatase gene) . This 
reporter gene and its products, namely the relevant 
reporter proteins, can be detected for example using 

20 fluorescence, chemiluminescence, colorimetrically or by 
means of immunological methods (for example, enzyme-linked 
immunosorbent assay, ELISA) . 

The reporter gene can be expressed either from a separate 
IRES or in the form of a fusion protein, which is active 
25 either as such or fused with a HCV protein via a 
poteolytically cleavable amino acid sequence in such a way 
that the reporter is separated from the HCV protein by 
cleavage of a cellular or viral (HCV) protease. 

30 

Example 10 : Production of HCV full-length constructs with 
Integrated foreign genes used as liver cell 
specific vectors for gene therapy or as 
expression vectors . 

35 

The construct (Fig. 14) is transfected in cells and leads 
to the formation of HCV virus particles that can be used 
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for the infection of further cells. Since the virus 
particles have encapsidated RNA with a foreign gene, it can 
be used in the infected cells for the production of the 
protein coded by this foreign gene. Cells transfected with 
5 the construct also express the foreign gene. 

Example 11: Production of monocis-tronic HCV-RNA constructs 
xn which the resistance gene product is 
10 expressed as a fusion protein with the HCV 

portion . 

It is an advantage for some tests if the HCV-RNA construct 
does not possess a heterologous IRES element. Tests of this 

15 type are, for example, the determination of interferon 
resistance. If a cell possessing a HCV-RNA construct is 
incubated with interferon alpha or beta, a reduction in 
replication of the HCV-RNA results. In order to explain the 
mechanism of this effect it is necessary for the HCV-RNA 

20 construct not to possess any heterologous IRES, as 
otherwise it is not possible to determine whether the 
interferon mediated inhibition is via inhibition of the HCV 
replication or inhibition of the heterologous IRES. For 
this reason constructs are produced for which the 

25 resistance gene is fused with a HCV protein (Fig. 15) . 
Either the fusion protein is active as such or the 
resistance gene product is linked to a HCV protein via a 
proteolytically cleavable amino acid sequence in such a way 
that it is separated from the HCV protein by a cellular or 

30 viral (HCV) protease. 
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Table 1: Specific inf activities (cfu/pg RNA) of HCV RNA 
constructs with adaptive mutations found with the 9-13F 
mutant introduced into the parental construct I389/NS3-3 ' /wt 



amino acid exchange*^ 


HCV protein 


cfu/ug RNA^ 


none 




30-60 


1283 arg -> gly 


NS3 


200 - 250 


1383 glu -> ala 


NS3 


30 - 60 


1577 lys -> arg 


NS3 


30 - 60 


1609 lys -> glu 


NS3 


160 - 300 


(1283 arg gly + 1383 glu - 
> ala + 1577 lys ~> arg + 
1609 lys -> glu) 


NS3 


360 - 420 


1936 pro -> ser 


NS4B 


1000-5000 


2163 glu -> gly 


NS5A 


1000-5000 


2330 lys -> glu 


NS5A 


30 - 60 


2442 ile -> val 


NS5A 


30 - 60 


all together 




5000 



amino acid change in the polyprotein of the HCV isolate 
con 1 (EMBL-gene bank No. AJ238799) ; amino acids are given 
in single letter code. 

^ Colony forming units (number of cell clones) obtained 
10 with a selection of 500 ]ag/ml G418. 
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Table 2: Specific inf activities (cfu/]jg RNA) of the 
parental HCV RNA construct I389/NS3-3 ' /wt and the variants 

9-131, 9-13F, 5.1 and 19. 



Transfected RNA variant 


cfu/pg RNA^ 


wild type 


30 - 50 


9-13 I 


100 - 1.000 


9-13 F 


1.000 - 10.000 


5 . 1 


50.000 - 100.000 


19 


50.000 - 100.000 



^ Colony forming units (number of cell clones) obtained 
with a selection of 500jjig/ml G418. 
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Table 3: Nucleotide and amino acid sequence differences 
between the parental HCV RNA construct laes/NSS-S ' /wt and 
the mutants 9-131, 9-13F, 5.1 und 19 



Clone 


nt-position 


n t - ex change 


aa-exchange 


9-13 I 


3685 


C > T 


P > L 




4933 


C > T 


T > M 




5249 


T > C 






8486 


C > T 






8821 


G > A 


W > stop 




8991 


C > G 


R > G 




9203 


A > G 






9313 


T > C 


F > S 




9346 


T > C 


V > A 


9-13 F 


3866 


C > T 






4188 


A > G 


R > G 




4489 


A > C 


E > A 




4562 


G > A 






4983 


T > C 






5071 


A > G 


K > R 




5166 


A > G 


K > E 




6147 


C > T 


P > S 




6829 


A > G 


E > G 




7329 


A > G 


K > E 




7664 


A > G 


I > V 




8486 


C > T 






o ri r\ T 

8 9 91 


C > G 


R > G 


NK5. 1 


4180 


C > T 


T > I 




4679 


C > T 






4682 


T > C 






5610 


C > A 


L > I 




6437 


A > G 






6666 


A > G 


N > D 
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6842 


C > T 






6926 


C > T 


_ 




6930 


T > C 


S > P 




7320 


C > T 


P > s 




7389 


A > G 


K > E 


NK19 


3946 


A > G 


E > G 




4078 


C > G 


A > G 




4180 


C > T 


T > I 




4682 


T > C 







5610 


C > A 


L > I 




5958 


A > T 


M > L 




6170 


T > A 


— 




6596 


G > A 






6598 


C > G 


A > G 




6833 


C > T 


_ 




6842 


C > T 


— 




6930 


T > C 


S > P 




7141 


A > G 


E > G 




7320 


C > T 


P > S 




7389 


A > G 


K > E 




7735 


G > A 


S > N 



Given are the differences between the nucleotide and amino 
acid sequences of the parental HCV RNA sequence con 1 
(EMBL-gene bank No. AJ238799) and those of the cell culture 
adapted HCV RNAs . Numbers refer to the nucleotide and amino 
acid positions of the con 1 isolate, nt, nucleotide; aa, 
amino acid. 
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SEQUENCE LISTING 

<110> Bartenschlager, Ralf 

<120> Hepatitis C Virus cell culture system 

<130> ba-1 

<140> 199 15 178.4 
<141> 1999-04-03 

<160> 11 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



<400> 1 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgct tgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaact gtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcattc 

gtgaaggaag 

tgcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggttaa 

cacgataata 

cgccgcccac 

ttgccgcgca 

caacctcgtg 

gctcagcccg 

ctcctgtcac 

cgcaatttgg 

attccgctcg 

gttctggagg 

ttccttttgg 

gtatccggag 

gcggacatga 

cgctgctggg 

acgatacgac 

gtgggagatc 

cggcacgaga 



gat tgggggc 
gaaagcgt ct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggct c 
ctgct tgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggatc 
aaaacgtcta 
ccatgggcac 
aggacgtcaa 
ggggccccag 
gaaggcgaca 
ggtacccctg 
cccgtggctc 
gtaaggtcat 
tcggcgcccc 
acggcgtgaa 
ctttgctgtc 
tgtaccatgt 
tcatgcatac 
tagcgctcac 
gccatgtcga 
tctgcggatc 
cagtacagga 



gacactccac 

agccatggcg 

tagtggtctg 

gatcaacccg 

gtgttgggtc 

aggtctcgta 

aacaccaacg 

tgggt ggaga 

gccgtgttcc 

ggtgccctga 

gttccttgcg 

ggcgaagtgc 

atcatggctg 

caccaagcga 

caggatgatc 

aaggcgcgca 

aatat catgg 

gcggaccgct 

gaatgggctg 

gccttctatc 

ctagcgggat 

cttggaataa 

tggcaatgtg 

ttcccctctc 

ggaagcttct 

acctggcgac" 

ggcacaaccc 

ctcaagcgta 

tgatctgggg 

ggccccccga 

gaatcctaaa 

gttcccgggc 

gttgggtgtg 

acctatcccc 

gcccctctat 

tcggcctagt 

cgataccctc 

cctagggggc 

ctatgcaaca 

ctgtttgacc 

cacgaacgac 

ccccgggtgc 

tcccacgctc 

tttgctcgtt 

tgttttcctc 

ctgcaattgc 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacat cgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
at caggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
'aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
cctcaaagaa 
ggtggtcaga 
cgcgcgacta 
aaggctcgcc 
ggcaatgagg 
tggggcccca 
acgtgcggct 
gctgccaggg 
gggaatctgc 
atcccagctt 
tgct ccaacg 
gtgccctgcg 
gcggccagga 
ggggcggctg 
gtcgcccagc 
tcaatatatc 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
aaaccaaacg 
tcgtcggtgg 
ggaagacttc 
agcccgaggg 
gcttggggtg 
cggacccccg 
tcgccgatct 
ccctggcgca 
ccggttgctc 
ccgcttatga 
caagcattgt 
ttcgggagaa 
acgctagcgt 
ctctctgctc 
tgttcacctt 
ccggccacgt 



ggaactactg 
cctccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggt tct tt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
cttt gaaaaa 
taacaccaac 
agtttacctg 
cgagcggtcg 
tagggcctgg 
ggcaggatgg 
gcgtaggtcg 
catggggtac 
tggcgtccgg 
cttttctatc 
agtgcgcaac 
gtatgaggca 
caactcctcc 
ccccactacg 
cgctatgtac 
ctcgcctcgc 
gacaggtcac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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cgtatggctt 

t tactccgga 

gcgggccttg 

ctctttgccg 

ctcgggatta 

aacggcagct 

ttccttgctg 

gccagctgca 

tcacacagct 

gt acccgcgg 

gggacgaccg 

ctgctt ctta 

agcactgggt 

aaaaccttga 

tgtggttcgg 

cactacccct 

gagcacaggc 

agggacagat 

tgttccttca 

gtggacgtac 

gagtatgtcc 

atgatgctgc 

gcatccgtgg 

tacatcaagg 

ctcct gctcc 

tcgtgcggag 

ctgttcctcg 

ttgcaagtgt 

acgtgcgcga 

ggtccactca 

gggctcattc 

gctctcatga 

cgggactggg 

tctgatatgg 

atcttgggcc 

agccttgaag 

cgaggcctac 

ggggaggtcc 

gtgtgttgga 

atcacccaaa 

gcgcgttcct 

gccgatgtca 

cccgtctcct 

gtgggcatct 

cccgtcgagt 

ccggccgtac 

agcactaagg 

tccgtcgccg 

aacatcagaa 

ggcaagtttc 

gagtgccact 

gagacggctg 

gtgccacatc 

ggcaaagcca 

aagaagaaat 

tattaccggg 

acggacgctc 

tgtgtcaccc 

gtgccacaag 

ggcatttaca 

ctgtgcgagt 

gttaggttgc 

ttctgggaga 

aagcaggcag 

gctcaggctc 

acgctgcacg 

accacacacc 



gggatatgat 

t cccacaagc 

cctactattc 

gcgttgacgg 

cgtccctctt 

ggcacatcaa 

cgctgt tcta 

gccccatcga 

cggaccagag 

cgcaggtgtg 

accggttcgg 

acaacacgcg 

tcaccaagac 

cctgccccac 

ggccttggtt 

gcactgtcaa 

tcgaagccgc 

cagagcttag 

ccaccctacc 

aatacctgta 

tgttgctctt 

tgatagctca 

ccggggcgca 

gcaggctggt 

tgctggcgtt 

gcgcggtttt 

ctaggctcat 

ggat cccccc 

tccacccaga 

tggtgctcca 

gtgcatgcat 

agttggccgc 

cccacgcggg 

agaccaaggt 

tgcccgtctc 

ggcaggggtg 

ttggctgcat 

aagtggtctc 

ctgtctatca 

tgtacaccaa 

tgacaccatg 

ttccggtgcg 

acttgaaggg 

ttcgggctgc 

ctatggaaac 

cgcagacatt 

tgccggctgc 

ccaccctagg 

ccggggtaag 

ttgccgacgg 

caactgactc 

gagcgcgact 

caaacatcga 

tccccatcga 

gtgatgagct 

gccttgatgt 

taatgacggg 

agacagtcga 

acgcggtgtc 

ggtttgtgac 

gctatgacgc 

gggcttacct 

gcgtctttac 

gagacaactt 

cacctccatc 

ggccaacgcc 

ccataaccaa 



gatgaactgg 

tgtcgtggat 

catggtgggg 

gggaacctat 

ttcacccggg 

caggactgcc 

cgtgcacaag 

cgcgttcgct 

gccttattgt 

tggtccagtg 

cgt ccctacg 

gccgccgcaa 

gtgcgggggc 

ggactgcttc 

gacacccaga 

cttt accatc 

atgcaattgg 

cccgctgctg 

ggctctgtcc 

cggtataggg 

ccttcttctg 

agctgaggcc 

tggcattctc 

ccctggggcg 

accaccacga 

cgtaggtctg 

atggtggtt a 

cctcaacgtt 

gctaatcttt 

ggctggtata 

gctggtgcgg 

actgacaggt 

cctacgagac 

tatcacctgg 

cgcccgcagg 

gcgactcctc 

cat cactagc 

caccgcaaca 

tggtgccggc 

tgtggaccag 

cacctgcggc 

ccggcggggc 

ctcttcgggc 

cgtgtgcacc 

cactatgcgg 

ccaggtggcc 

gtatgcagccr 

tttcggggcg 

gaccatcacc 

tggttgctct 

gaccactatc 

cgtcgtgctc 

ggaggtggct 

gaccatcaag 

cgccgcgaag 

atccgtcata 

ctttaccggc 

cttcagcctg 

acgctcgcag 

tccaggagaa 

gggctgtgct 

aaacacacca 

aggcct cacc 

cccctacctg 

gtgggaccaa 

cctgctgtat 

atacatcatg 



t cacctacag 
atggtggcgg 
aactgggcta 
gtgacagggg 
tcatcccaga 
Gtgaactgca 
ttcaactcat 
caggggtggg 
tggcactacg 
t actgct tea 
t acagttggg 
ggcaactggt 
cGcccgtgta 
cggaagcacc 
tgct t ggt CG 
tt caaggtta 
actcgaggag 
ctgtctacaa 
actggtttga 
tcggcggttg 
gcggacgcgc 
gccctagaga 
tccttcct eg 
gcatatgccc 
gcatacgcGa 
atactGttga 
caatattt ta 
cgggggggcc 
accat caeca 
accaaagtgc 
aaggttgctg 
acgtacgttt 
cttgcggtgg 
ggggcagaca 
gggagggaga 
gcgectatta 
ct cacaggce 
caatctttec 
t eaaagaeec 
gaectcgteg 
aget cggaec 
gacageaggg 
ggtccactgc 
cgaggggttg 
tccccggtct 
catctacacg 
^caagggtata 
tatatgtcta 
acgggtgccc 
gggggegcct 
ctgggcatcg 
gccacegeta 
etgtceagca 
ggggggaggc 
ctgtccggcc 
ccaactagcg 
gatttcgact 
gacccgacct 
cggcgaggca 
cggccctcgg 
tggtacgagc 
gggttgcccg 
cacat agacg 
gtagcatace 
atgtggaagt 
aggctgggag 
gcatgcatgt 



cagecctagt: 

gggcecat tg 

aggttctgat 

ggacgatggc 

aaatccagct 

atgactccct 

etggatgccc 

ggcccatcac 

caccccggcc 

ccccaagece 

gggagaatga 

ttggct gtac 

acatcggggg 

cegaggeeac 

actacccar a 

ggatgtacgt 

agcgttgtaa 

cggagtggca 

tccatctcca 

tot cctttgc 

gcgtctgtgc 

acctggtggt 

tgttcttet g 

t etacggcgt 

tggaceggga 

cettgtcacc 

tcaccagggc 

gegatgecgt 

aaatettgct 

egtactt cgt 

ggggtcatta 

atgaccatct 

cagttgagcc 

ccgcggcgtg 

tacatctggg 

cggcctactc 

gggacaggaa 

tggcgacctg 

ttgccggccc 

gctggcaagc 

tttacttggt 

ggagcctact 

tctgcccctc 

cgaaggcggt 

tcacggacaa 

cccetactgg 

aggtgcttgt 

aggcacatgg 

ccatcacgta 

atgacatcat 

gcacagtcct 

cgcctccggg 

ctggagaaat 

acctcattt t 

tcggactcaa 

gagacgtcat 

cagtgatcga 

tcaccattga 

ggactggtag 

gcatgttcga 

tcacgeecge 

tetgccagga 

cecat tt ctt 

aggctaeggt 

gtctcatacg 

ccgttcaaaa 

cggctgacct 



ggtatcgcag 

gggagtccta 

tgtgatgcta 

caaaaacacc 

tgtaaacacc 

caacactggg 

agagcgcatg 

ttacaatgag 

gtgcggt at c 

tgtcgtggtg 

gacggacgtg 

atggatgaat 

gatcggcaat 

ttacaccaag 

caggctttgg 

ggggggagtg 

cctggaggac 

ggtattgccc 

tcagaacgtc 

aatcaaatgg 

ctgcttgtgg 

cct caacgcg 

tgctgcctgg 

atggccgeta 

gatggcagca 

gcact ataag 

cgaggeaeac 

catcctcetc 

cgccatactc 

gegcgcacac 

tgtccaaatg 

caccccactg 

egtcgtcttc 

tggggacatc 

accggcagac 

ccaacagacg 

ccaggtcgag 

cgt caatggc 

aaagggccca 

gccccccggg 

eacgaggcat 

ctcccccagg 

ggggcacgct 

ggactttgta 

ctcgtcccct 

tagcggcaag 

cctgaacccg 

tatcgaccct 

ctccacctat 

aatatgtgat 

ggaccaagcg 

atcggtcacc 

eeccttttat 

ctgccattcc 

tgctgtagca 

tgtcgtagca 

ctgcaataca 

gacgacgacc 

gggcaggatg 

ttcctcggtt 

cgagacctca 

ccatctggag 

gtcccagact 

gtgcgccagg 

gctaaagcct 

cgaggttact 

ggaggtcgtc 



2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 
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acgagcacct 
acaggcagcg 
gacagggaag 
tacatcgaac 
caaacagcca 
ctcgaagcct 
ggcttgtcca 
atcaccagcc 
gccgcccaac 
gcggctgt tg 
gcaggggtgg 
gaggacctgg 
gtgtgcgcag 
aaccggctga 
gagagcgacg 
ctgaagaggc 
ctaagagatg 
tccaagctcc 
ggagtctggc 
ggacatgtga 
catggaacat 
aattat tcta 
ggggatttcc 
ccggcccccg 
tgcaaacccc 
gggtcacagc 
gacccctccc 
tccttggcca 
acccgtcatg 
atgggcggga 
gagccgctcc 
aggtccagga 
ctgt t agagt 
ccgcct gcca 
gaatct accg 
tcgtcggccg 
gacgcgggat 
gatcccgatc 
gtctgctgct 
gaaaccaagc 
tatgctacaa 
caggtcctgg 
gttaaggcta 
agatctaaat 
cacatccgct 
atcatggcaa 
cgccttatcg 
gtggtctcca 
ggacagcggg 
gcatatgaca 
tcaatctacc 
gagcggcttt 
cggtgccgcg 
aaggccgctg 
gacgaccttg 
gccttcacgg 
tacgacttgg 
ggcaaaaggg 
gagacagcta 
accttgtggg 
caacttgaaa 
gacctacctc 
tctccaggtg 
cgagtctgga 
gctgccactt 
ccaatcccgg 
ggagacatat 



gggtgctggt 
tggt cattgt 
tcctttaccg 
agggaatgca 
ccaagcaagc 
tctgggcgaa 
ctctgcctgg 
cgctcaccac 
ttgct cct cc 
gcagcatagg 
caggcgcgct 
ttaacctact 
cgat actgcg 
tagcgttcgc 
ctgcagcacg 
ttcaccagtg 
tttgggattg 
tgccgcgatt 
ggggcgacgg 
aaaacggttc 
t ccccattaa 
gggcgctgtg 
actacgtgac 
aatt cttcac 
tcctacggga 
tcccatgcga 
acattacggc 
gctcatcagc 
actccccgga 
acatcacccg 
aagcggagga 
aatt ccct eg 
cctggaagga 
aggcccct cc 
t gtcttctgc 
tcgacagcgg 
ccgacgttga 
tcagcgacgg 
cgatgtccta 
tgcccatcaa 
catctcgcag 
acgaccacta 
aacttctatc 
ttggctatgg 
ccgtgtggaa 
aaaatgaggt 
tatt cccaga 
ccctccctca 
tcgagttcct 
cccgctgttt 
aatgttgtga 
acatcggggg 
cgagcggtgt 
cggcctgtcg 
tcgttatctg 
aggctatgac 
agttgataac 
tgtactat ct 
gacacactcc 
caaggatgat 
aagccctaga 
agatcatt ca 
agatcaatag 
gacatcgggc 
gtggcaagta 
ctgcgtccca 
atcacagcct 



aggcggagtc 

gggcaggatc 

ggagttcgat 

gctcgccgaa 

ggaggctgct 

gcatatgtgg 

caaccccgcg 

ccaacatacc 

cagcgctgct 

ccttgggaag 

cgtggccttt 

ccctgctatc 

tcggcacgtg 

t tcgcggggt 

tgtcactcag 

gatcaacgag 

gatatgcacg 

gccgggagtc 

catcatgcaa 

catgaggatc 

cgcgtacacc 

gcgggtggct 

gggcatgacc 

agaagtggat 

ggaggtcaca 

gcccgaaccg 

ggagacggct 

tagccagctg 

cgctgacctc 

cgtggagtca 

ggatgagagg 

agcgatgccc 

cccggactac 

gat accacct 

cttggcggag 

cacggcaacg 

gt cgtactcc 

gtcttggtct 

cacatggaca 

tgcactgagc 

cgcaagcctg 

ccgggacgtg 

cgtggaggaa 

ggcaaaggac 

ggacttgctg 

ttt ctgcgtc 

tttgggggtf" 

ggccgtgatg 

ggtgaatgcc 

tgactcaacg 

cttggccccc 

ccccctgact 

actgacgacc 

agctgcgaag 

tgaaagcgcg 

tagatactct 

at catgctcc 

cacccgtgac 

agtcaattcc 

cctgatgact 

ttgtcagatc 

acgactccat 

ggtggcttca 

cagaagtgtc 

cctcttcaac 

gttggattta 

gtctcgtgcc 



ctagcagctc 
atcttgtccg 
gagatggaag 
caattcaaac 
gctcccgtgg 
aatttcatca 
atagcatcac 
ctcct gttta 
t ctgct tt eg 
gtgcttgtgg 
aaggtcatga 
ctctcccctg 
ggcccagggg 
aaccacgtct 
atcctctcta 
gactgctcca 
gtgttgactg 
cccttcttct 
accacctgcc 
gtggggccta 
acgggcccct 
gctgaggagt 
actgacaacg 
ggggtgcggt 
ttcctggtcg 
gacgtagcag 
aagcgtaggc 
tctgcgcctt 
atcgaggcca 
gaaaataagg 
gaagtatccg 
atatgggcac 
gtccctccag 
ccacggagga 
ctcgccacaa 
gcctctcctg 
tccatgcccc 
accgtaagcg 
ggcgccctga 
aactctttgc 
cggcagaaga 
ctcaaggaga 
gcctgtaagc 
gtccggaacc 
gaagacactg 
caaccagaga 
cgtgtgtgcg 
ggctcttcat 
tggaaagcga 
gtcactgaga 
gaagccagac 
aattctaaag 
agctgcggta 
ctccaggact 
gggacccaag 
gccccccctg 
t ccaatgtgt 
cccaccaccc 
tggctaggca 
catttcttct 
tacggggcct 
ggccttagcg 
tgcctcagga 
cgcgctaggc 
tgggcagtaa 
tccagctggt 
cgaccccgct 



tggccgcgta 

gaaagccggc 

agtgcgcctc 

agaaggcaat 

tggaatccaa 

gcgggataca 

tgatggcatt 

acatcctggg 

taggcgccgg 

atattttggc 

gcggcgagat 

gcgccctagt 

agggggctgt 

cccccacgca 

gtcttaccat 

cgccatgctc 

atttcaagac 

catgtcaacg 

catgtggagc 

ggacctgtag 

gcacgccctc 

acgtggaggt 

taaagtgccc 

tgcacaggta 

ggctcaatca 

tgctcacttc 

tggccagggg 

ccttgaaggc 

acctcctgtg 

tagtaatttt 

tt ccggcgga 

gcccggatta 

tggt acacgg 

agaggacggt 

agaccttcgg 

accagccctc 

cccttgaggg 

aggaggctag 

tcacgccatg 

tccgtcacca 

aggtcacctt 

tgaaggcgaa 

tgacgccccc 

tatccagcaa 

agacaccaat 

aggggggccg 

agaaaatggc 

acggattcca 

agaaatgccc 

atgacatccg 

aggccataag 

ggcagaactg 

ataccctcac 

gcacgatgct 

aggacgaggc 

gggacccgcc 

cagtcgcgca 

cccttgcgcg 

acatcatcat 

ccatccttct 

gttactccat 

cattttcact 

aacttggggt * 

tactgtccca 

ggaccaagct 

tcgttgctgg 

ggttcatgtg 



ttgcctgaca 
catcattccc 
acacctccct 
cgggttgctg 
gtggcggacc 
atatttagca 
cacagcctct 
gggatgggtg 
cat cgctgga 
aggtt atgga 
gccct ccacc 
cgtcggggtc 
gcagtggatg 
ctatgtgcct 
cact cagctg 
cggctcgtgg 
ctggctccag 
tgggtacaag 
acagatcacc 
taacacgtgg 
cccggcgcca 
tacgcgggtg 
gtgtcaggtt 
cgctccagcg 
atacctggtt 
catgctcacc 
atctcccccc 
aacatgcact 
gcggcaggag 
ggactctttc 
gatcctgcgg 
caaccct cca 
gtgtccattg 
tgtcctgt ca 
cagctccgaa 
cgacgacggc 
ggagccgggg 
tgaggacgtc 
cgctgcggag 
caacttggtc 
tgacagactg 
ggcgtccaca 
acattcggcc 
ggccgttaac 
tgacaccacc 
caagccagct 
cctttacgat 
atactctcct 
tatgggcttc 
tgttgaggag 
gtcgctcaca 
cggctatcgc 
atgttacttg 
cgtatgcgga 
gagcctacgg 
caaaccagaa 
cgatgcatct 
ggctgcgtgg 
gtatgcgccc 
agctcaggaa 
tgagccactt 
ccat agttac 
accgcccttg 
gggggggagg 
caaactcact 
ttacagcggg 
gtgcctactc 



6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 ' 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 
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ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 

ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 

tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 

tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 

agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 2 
<211> 8637 
<212> DNA 

<213> Hepatitis C Virus 



<400> 2 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 4 80 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
Gcgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggaccggg agatggcagc atcgtgcgga ggcgcggttt tcgtaggtct gatactcttg 1860 
accttgtcac cgcactataa gctgttcctc gctaggctca tatggtggtt acaatatttt 1920 
atcaccaggg ccgaggcaca cttgcaagtg tggatccccc ccctcaacgt tcgggggggc 1980 
cgcgatgccg tcatcctcct cacgtgcgcg atccacccag agctaatctt taccatcacc 2040 
aaaatcttgc tcgccatact cggtccactc atggtgctcc aggctggtat aaccaaagtg 2100 
ccgtacttcg tgcgcgcaca cgggctcatt cgtgcatgca tgctggtgcg gaaggttgct 2160 
gggggtcatt atgtccaaat ggctctcatg aagttggccg cactgacagg tacgtacgtt 2220 
tatgaccatc tcaccccact gcgggactgg gcccacgcgg gcctacgaga ccttgcggtg 2280 
gcagttgagc ccgtcgtctt ctctgatatg gagaccaagg ttatcacctg gggggcagac 2340 
accgcggcgt gtggggacat cat cttgggc "ctgcccgt ct ccgcccgcag ggggagggag 2400 
atacatctgg gaccggcaga cagccttgaa gggcaggggt ggcgactcct cgcgcctatt 24 60 
acggcctact cccaacagac gcgaggccta cttggctgca tcatcactag cctcacaggc 2520 
cgggacagga accaggtcga gggggaggtc caagtggtct ccaccgcaac acaatctttc 2580 
ctggcgacct gcgtcaatgg cgtgtgttgg actgtctatc atggtgccgg ctcaaagacc 2640 
cttgccggcc caaagggccc aatcacccaa atgtacacca atgtggacca ggacctcgtc 2700 
ggctggcaag cgccccccgg ggcgcgttcc ttgacaccat gcacctgcgg cagctcggac 2760 
ctttacttgg tcacgaggca tgccgatgtc attccggtgc gccggcgggg cgacagcagg 2820 
gggagcctac tctcccccag gcccgtctcc tacttgaagg gctcttcggg cggtccactg 2880 
ctctgcccct cggggcacgc tgtgggcatc tttcgggctg ccgtgtgcac ccgaggggtt 2940 
gcgaaggcgg tggactttgt acccgtcgag tctatggaaa ccactatgcg gtccccggtc 3000 
ttcacggaca actcgtcccc tccggccgta ccgcagacat tccaggtggc ccatctacac 3060 
gcccctactg gtagcggcaa gagcactaag gtgccggctg cgtatgcagc ccaagggtat 3120 
aaggtgcttg tcctgaaccc gtccgtcgcc gccaccctag gtttcggggc gtatatgtct 3180 
aaggcacatg gtatcgaccc taacatcaga accggggtaa ggaccatcac cacgggtgcc 3240 
cccatcacgt actccaccta tggcaagttt cttgccgacg gtggttgctc tgggggcgcc 3300 
tatgacatca taatatgtga tgagtgccac tcaactgact cgaccactat cctgggcatc 3360 
ggcacagtcc tggaccaagc ggagacggct ggagcgcgac tcgtcgtgct cgccaccgct 3420 
acgcctccgg gatcggtcac cgtgccacat ccaaacatcg aggaggtggc tctgtccagc 34 80 
actggagaaa tcccctttta tggcaaagcc atccccatcg agaccatcaa gggggggagg 3540 
cacctcattt tctgccattc caagaagaaa tgtgatgagc tcgccgcgaa gctgtccggc 3600 
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ctcggactca 
ggagacgtca 
tcagtgatcg 
ttcaccattg 
aggactggta 
ggcatgt teg 
ctcacgcccg 
gtctgccagg 
gcccatttct 
caggct acgg 
tgtct catac 
gccgttcaaa 
tcggctgacc 
ctggccgcgt 
ggaaagccgg 
gagtgcgcct 
cagaaggcaa 
gtggaatcca 
agcgggatac 
ctgatggcat 
aacat cctgg 
gtaggcgccg 
gatat tttgg 
agcggcgaga 
ggcgccct ag 
gagggggctg 
tcccccacgc 
agtcttacca 
acgccatgct 
gatttcaaga 
tcatgtcaac 
ccatgtggag 
aggacctgt a 
tgcacgccct 
tacgtggagg 
gtaaagtgcc 
ttgcacaggt 
gggctcaatc 
gtgctcactt 
ctggccaggg 
tccttgaagg 
aacctcctgt 
gtagtaattt 
gttccggcgg 
cgcccggatt 
gtggtacacg 
aagaggacgg 
aagacctt eg 
gaccagccct 
ccccttgagg 
gaggaggcta 
atcacgccat 
ctccgtcacc 
aaggtcacct 
atgaaggcga 
ctgacgcccc 
ctat ccagca 
gagacaccaa 
aaggggggcc 
gagaaaatgg 
tacggattcc 
aagaaatgcc 
aatgacatcc 
caggccataa 
gggcagaact 
aataccctca 
tgcacgatgc 



atgctgtagc 
ttgtcgtagc 
actgcaatac 
agacgacgac 
ggggcaggat 
attcctcggt 
ccgagacctc 
accat ctgga 
t gtcccagac 
tgtgcgccag 
ggctaaagcc 
acgaggt tac 
tggaggt cgt 
attgcctgac 
ccatcattcc 
cacacctccc 
tcgggttgct 
agtggcggac 
aatatttagc 
tcacagcctc 
ggggatgggt 
gcatcgctgg 
caggttat gg 
tgccct ccac 
tcgtcggggt 
tgcagtggat 
actatgtgcc 
tcactcagct 
ccggctcgtg 
cctggctcca 
gtgggtacaa 
cacagatcac 
gtaacacgtg 
ccccggcgcc 
ttacgcgggt 
cgtgtcaggt 
acgctccagc 
aatacctggt 
ccatgctcac 
gatctccccc 
caacatgcac 
ggcggcagga 
tggactcttt 
agatcctgcg 
acaaccctcc 
ggtgtccatt 
t tgtcctgtc 
gcagctccga 
ccgacgacgg 
gggagccggg 
gtgaggacgt 
gcgctgcgga 
acaacttggt 
ttgacagact 
aggcgtccac 
cacattcggc 
aggccgttaa 
ttgacaccac 
gcaagccagc 
ccctttacga 
aat act ctcc 
ctatgggctt 
gtgttgagga 
ggtcgctcac 
gcggctatcg 
catgttactt 
tcgtatgcgg 



atattaccgg 
aacggacgct 
atgtgtcacc 
cgtgccacaa 
gggcatttac 
tctgtgcgag 
agt taggt tg 
gttctgggag 
taagcaggca 
ggctcaggct 
tacgctgcac 
taccacacac 
cacgagcacc 
aacaggcagc 
cgacagggaa 
ttacatcgaa 
gcaaacagcc 
cctcgaagcc 
aggcttgtcc 
tatcaccagc 
ggccgcccaa 
agcggctgtt 
agcaggggtg 
cgaggacctg 
cgtgtgcgca 
gaaccggctg 
tgagagcgac 
gctgaagagg 
gctaagagat 
gtccaagctc 
gggagtctgg 
cggacatgtg 
gcat ggaaca 
aaattat t ct 
gggggatttc 
tccggccccc 
gtgcaaaccc 
tgggtcacag 
cgacccctcc 
ctccttggcc 
tacccgtcat 
gatgggcggg 
cgagccgctc 
gaggtccagg 
actgttagag 
gccgcctgcc 
agaat ctacc 
atcgt cggcc 
cgacgcggga 
ggatcccgat 
cgtctgctgc 
ggaaaccaag 
ctatgctaca 
gcaggtcctg 
agttaaggct 
cagatctaaa 
ccacatccgc 
catcatggca 
tcgccttatc 
tgtggtctcc 
tggacagcgg 
cgcatatgac 
gtcaatct ac 
agagcggctt 
ccggtgccgc 
gaaggccgct 
agacgacctt 



ggcct tgatg 
ctaatgacgg 
cagacagtcg 
gacgcggtgt 
aggtttgtga 
tgctatgacg 
cgggcttacc 
agcgtcttta 
ggagacaact 
ccacct ccat 
gggccaacgc 
cccataacca 
tgggtgctgg 
gtggtcattg 
gtcctttacc 
cagggaatgc 
accaagcaag 
ttctgggcga 
actctgcctg 
ccgctcacca 
cttgctcctc 
ggcagcatag 
gcaggcgcgc 
gttaacctac 
gcgatactgc 
atagcgttcg 
gctgcagcac 
cttcaccagt 
gtttgggatt 
ctgccgcgat 
cggggcgacg 
aaaaacggtt 
tt ccccatta 
agggcgctgt 
cactacgtga 
gaattcttca 
ct cctacggg 
ctcccatgcg 
cacattacgg 
agctcatcag 
gactccccgg 
aacatcaccc 
caagcggagg 
aaattccctc 
tcctggaagg 
aaggcccctc 
■'gtgtcttctg 
gtcgacagcg 
t ccgacgttg 
ctcagcgacg 
tcgatgtcct 
ctgcccatca 
acatctcgca 
gacgaccact 
aaacttctat 
tttggctatg 
tccgtgtgga 
aaaaatgagg 
gtattcccag 
accctccctc 
gtcgagttcc 
acccgctgtt 
caatgttgtg 
tacat cgggg 
gcgagcggtg 
gcggcctgtc 
gtcgttatct 



tatccgtcat 
gctttaccgg 
acttcagcct 
cacgctcgca 
ctccaggaga 
cgggctgtgc 
taaacacacc 
caggcctcac 
tcccctacct 
cgtgggacca 
ccct gctgt a 
aatacatcat 
taggcggagt 
tgggcaggat 
gggagttcga 
agctcgccga 
cggaggctgc 
agcatatgtg 
gcaaccccgc 
cccaacatac 
ccagcgctgc 
gccttgggaa 
tcgtggcctt 
tccctgctat 
gtcggcacgt 
cttcgcgggg 
gtgtcactca 
ggatcaacga 
ggatatgcac 
tgccgggagt 
gcat catgca 
ccatgaggat 
acgcgtacac 
ggcgggtggc 
cgggcatgac 
cagaagtgga 
aggaggtcac 
agcccgaacc 
cggagacggc 
ctagccagct 
acgctgacct 
gcgtggagtc 
aggatgagag 
gagcgatgcc 
acccggacta 
cgataccacc 
ccttggcgga 
gcacggcaac 
agtcgtactc 
ggtcttggtc 
acacatggac 
atgcactgag 
gcgcaagcct 
accgggacgt 
ccgtggagga 
gggcaaagga 
aggacttgct 
ttttctgcgt 
atttgggggt 
aggccgtgat 
tggtgaatgc 
t tgactcaac 
acttggcccc 
gccccctgac 
tactgacgac 
gagctgcgaa 
gtgaaagcgc 



accaactagc 
cgatttcgac 
ggacccgacc 
gcggcgaggc 
acggccctcg 
ttggtacgag 
agggttgccc 
ccacatagac 
ggtagcatac 
aatgtggaag 
t aggctggga 
ggcatgcatg 
cctagcagct 
catcttgtcc 
tgagatggaa 
acaattcaaa 
tgctcccgtg 
gaatttcatc 
gatagcatca 
cctcctgttt 
ttctgctttc 
ggtgcttgtg 
taaggtcatg 
cctctcccct 
gggcccaggg 
taaccacgtc 
gat cctctct 
ggactgctcc 
ggtgttgact 
ccccttcttc 
aaccacctgc 
cgtggggcct 
cacgggcccc 
tgctgaggag 
cactgacaac 
tggggtgcgg 
attcctggtc 
ggacgtagca 
taagcgtagg 
gtctgcgcct 
catcgaggcc 
agaaaataag 
ggaagtatcc 
catatgggca 
cgtccctcca 
tccacggagg 
gctcgccaca 
ggcct ctcct 
ct ccatgccc 
taccgtaagc 
aggcgccctg 
caactctttg 
gcggcagaag 
gctcaaggag 
agcctgtaag 
cgtccggaac 
ggaagacact 
ccaaccagag 
tcgtgtgtgc 
gggctcttca 
ctggaaagcg 
ggtcactgag 
cgaagccaga 
taattctaaa 
cagctgcggt 
gctccaggac 
ggggacccaa 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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gaggacgagg 
ggggacccgc 
tcagtcgcgc 
ccccttgcgc 
aacatcatca 
tccatccttc 
tgttactcca 
gcattttcac 
aaacttgggg 
ctactgtccc 
aggaccaagc 
ttcgttgctg 
tggttcatgt 
cgatgaacgg 
tttttctttt 
ttttcctttt 
gtccgtgagc 



cgagcctacg 
ccaaaccaga 
acgatgcatc 
gggctgcgtg 
tgtatgcgcc 
tagctcagga 
ttgagccact 
tccatagtta 
taccgccctt 
agggggggag 
tcaaact cac 
gttacagcgg 
ggtgcctact 
ggagctaaac 
tttttttttt 
ctttcctttg 
cgcttgactg 



ggccttcacg 
atacgacttg 
tggcaaaagg 
ggagacagct 
caccttgtgg 
acaacttgaa 
tgacctacct 
ctctccaggt 
gcgagtctgg 
ggctgccact 
t ccaat cccg 
gggagacata 
cctact ttct 
act ccaggcc 
tttttttttt 
gtggctccat 
cagagagtgc 



gaggctatga 
gagttgataa 
gtgtactatc 
agacacactc 
gcaaggatga 
aaagccctag 
cagatcattc 
gagatcaata 
agacat cggg 
tgtggcaagt 
gctgcgt ccc 
tatcacagcc 
gtaggggtag 
aataggccat 
tttttttttt 
cttagcccta 
tgatactggc 



ctagatactc 
catcatgctc 
tcacccgtga 
cagtcaattc 
tcctgatgac 
attgtcagat 
aacgactcca 
gggtggcttc 
ccagaagtgt 
acctcttcaa 
agttggattt 
tgt ctcgtgc 
gcatctat ct 
cctgtttttt 
ctcctttttt 
gtcacggcta 
ctctctgcag 



tgccccccct 
ctccaatgtg 
ccccaccacc 
ctggctaggc 
tcatttcttc 
ctacggggcc 
tggccttagc 
atgcctcagg 
ccgcgctagg 
ctgggcagta 
atccagctgg 
ccgaccccgc 
act ccccaac 
tccctttttt 
tttcctcttt 
gctgtgaaag 
atcaagt 



7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8637 
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<210> 3 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
<400> 3 

gccagccccc gattgggggc gacactccac caragatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggotg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggt tgcgaag ' gcggtggact ttgtacccgt cgagtctatg 2340 
gaaaccacta tgcggtcccc ggtcttcacg gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 2460 
gctgcgtatg cagcccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaaccggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 27 60 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgattt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
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ccatcgtggg 
acgcccctgc 
accaaataca 
ctggtaggcg 
attgtgggca 
taccgggagt 
atgcagct eg 
caagcggagg 
gcgaagcata 
cctggcaacc 
accacccaac 
cctcccagcg 
ataggccttg 
gcgctcgtgg 
ctactccctg 
ctgcgtcggc 
ttcgcttcgc 
gcacgtgtca 
cagtggatca 
gattggatat 
cgattgccgg 
gacggcatca 
ggttccatga 
attaacgcgt 
ctgtggcggg 
gtgacgggca 
ttcacagaag 
cgggaggagg 
tgcgagcccg 
acggcggaga 
tcagctagcc 
ccggacgctg 
acccgcgtgg 
gaggaggatg 
cctcgagcga 
aaggacccgg 
cct ccgatac 
tctgccttgg 
agcggcacgg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
atcaatgcac 
cgcagcgcaa 
cactaccggg 
ctatccgtgg 
tatggggcaa 
tggaaggact 
gaggttttct 
ccagatttgg 
cctcaggccg 
ttcctggtga 
tgttttgact 
tgtgacttgg 
gggggccccc 
ggtgtactga 
tgtcgagctg 
atctgtgaaa 
atgactagat 
ataacatcat 
tatctcaccc 
actccagtca 
atgatcctga 
ctagattgtc 
attcaacgac 
aatagggtgg 
cgggccagaa 



accaaatgtg 
tgtataggct 
tcatggcatg 
gagtcctagc 
ggat cat ct t 
t cgatgagat 
ccgaacaat t 
ctgctgctcc 
tgtggaattt 
ccgcgatagc 
ataccctcct 
ctgcttctgc 
ggaaggt get 
cctttaaggt 
ctatcctctc 
acgtgggccc 
ggggtaacca 
ctcagatcct 
acgaggactg 
gcacggtgtt 
gagt cccctt 
tgcaaaccac 
ggatcgtggg 
acaccacggg 
tggctgctga 
tgaccactga 
tggatggggt 
tcacattcct 
aaccggacgt 
cggctaagcg 
agctgtctgc 
acctcatcga 
agtcagaaaa 
agagggaagt 
tgcccatat g 
actacgtccc 
cacct ccacg 
cggagctcgc 
caacggcct c 
actcctccat 
ggtctaccgt 
ggacaggcgc 
tgagcaactc 
gcctgcggca 
acgtgctcaa 
aggaagcctg 
aggacgt ccg 
tgctggaaga 
gcgtccaacc 
gggttcgtgt 
tgatgggctc 
atgcctggaa 
caacggtcac 
cccccgaagc 
tgactaattc 
cgaccagctg 
cgaagctcca 
gcgcggggac 
actctgcccc 
gctcct ccaa 
gtgaccccac 
attcctggct 
tgactcattt 
agatctacgg 
tccatggcct 
cttcatgcct 
gtgtccgcgc 



gaagtgtctc 

gggagccgtt 

catgtcggct 

agctctggcc 

gtccggaaag 

ggaagagtgc 

caaacagaag 

cgtggtggaa 

cat cagcggg 

atcactgatg 

gtttaacatc 

ttt cgtaggc 

tgtggatat t 

catgagcggc 

ccctggcgcc 

aggggagggg 

cgtctccccc 

ctctagtctt 

ctccacgcca 

gactgatttc 

cttct catgt 

ctgcccatgt 

gcctaggacc 

cccctgcacg 

ggagtacgt g 

caacgtaaag 

gcggttgcac 

ggtcgggctc 

agcagtgctc 

taggctggcc 

gccttccttg 

ggccaacctc 

taaggtagta 

atccgttccg 

ggcacgcccg 

tccagtggta 

gaggaagagg 

cacaaagacc 

tcctgaccag 

gccccccctt 

aagcgaggag 

cctgatcacg 

tttgctccgt 

gaagaaggtc 

ggagatgaag 

taagctgacg 

gaacctatcc' 

cactgagaca 

agagaagggg 

gtgcgagaaa 

ttcatacgga 

agcgaagaaa 

tgagaatgac 

cagacaggcc 

taaagggcag 

cggtaatacc 

ggactgcacg 

ccaagaggac 

ccctggggac 

tgtgtcagtc 

cacccccctt 

aggcaacat c 

ctt ctccatc 

ggcctgttac 

tagcgcattt 

caggaaactt 

taggctactg 



atacggctaa 
caaaacgagg 
gacctggagg 
gcgtattgcc 
ccggccatca 
gcctcacacc 
gcaatcgggt 
tccaagtggc 
atacaatatt 
gcattcacag 
ctggggggat 
gccggcatcg 
ttggcaggtt 
gagatgccct 
ctagtcgtcg 
gctgtgcagt 
acgcactatg 
accatcactc 
tgctccggct 
aagacctggc 
caacgtgggt 
ggagcacaga 
tgtagtaaca 
ccct ccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 
attttggact 
gcggagatcc 
gattacaacc 
cacgggtgtc 
acggttgtcc 
tt cggcagct 
ccct ccgacg 
gagggggagc 
gctagtgagg 
ccatgcgctg 
caccacaact 
acctttgaca 
gcgaaggcgt 
cccccacatt 
'agcaaggccg 
ccaattgaca 
ggccgcaagc 
atggcccttt 
ttccaatact 
tgccctatgg 
atccgtgttg 
ataaggtcgc 
aactgcggct 
ctcacatgtt 
atgctcgtat 
gaggcgagcc 
ccgcccaaac 
gcgcacgatg 
gcgcgggctg 
atcatgtatg 
cttctagctc 
t ccattgagc 
tcactccata 
ggggtaccgc 
tcccaggggg 



agcctacgct 
ttactaccac 
tcgtcacgag 
tgacaacagg 
ttcccgacag 
tcccttacat 
t gctgcaaac 
ggaccctcga 
tagcaggctt 
cctctatcac 
gggtggccgc 
ctggagcggc 
atggagcagg 
ccaccgagga 
gggtcgtgtg 
ggatgaaccg 
tgcctgagag 
agctgctgaa 
cgtggctaag 
tccagtccaa 
acaagggagt 
tcaccggaca 
cgtggcatgg 
cgccaaatta 
gggtggggga 
aggttccggc 
cagcgtgcaa 
tggttgggtc 
tcaccgaccc 
ccccctcctt 
gcactacccg 
aggagatggg 
ct ttcgagcc 
tgcggaggt c 
ctccactgtt 
cattgccgcc 
tgtcagaat c 
ccgaatcgtc 
acggcgacgc 
cgggggatcc 
acgtcgtctg 
cggaggaaac 
tggtctatgc 
gactgcaggt 
ccacagttaa 
cggccagatc 
ttaaccacat 
ccaccatcat 
cagctcgcct 
acgatgtggt 
ctcctggaca 
gcttcgcata 
aggagtcaat 
tcacagagcg 
atcgccggtg 
acttgaaggc 
gcggagacga 
tacgggcctt 
cagaatacga 
catctggcaa 
cgtgggagac 
cgcccacct t 
aggaacaact 
cacttgacct 
gttactct CO 
ccttgcgagt 
ggagggctgc 



gcacgggcca 
acaccccata 
cacctgggtg 
cagcgtggtc 
ggaagtcctt 
cgaacaggga 
agccaccaag 
agccttctgg 
gtccactct g 
cagcccgct c 
ccaacttgct 
tgttggcagc 
ggtggcaggc 
cctggttaac 
cgcagcgata 
gctgatagcg 
cgacgctgca 
gaggcttcac 
agatgtttgg 
gctcctgccg 
ctggcggggc 
tgtgaaaaac 
aacattcccc 
ttctagggcg 
tttccactac 
ccccgaattc 
acccctccta 
acagctccca 
ctcccacatt 
ggccagctca 
tcatgactcc 
cgggaacatc 
gctccaagcg 
caggaaattc 
agagtcctgg 
tgccaaggcc 
taccgtgtct 
ggccgt cgac 
gggatccgac 
cgatctcagc 
ctgctcgatg 
caagctgccc 
tacaacatct 
cctggacgac 
ggctaaactt 
taaatttggc 
ccgctccgtg 
ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgggtcgag 
tgacacccgc 
ctaccaatgt 
gctttacatc 
ccgcgcgagc 
cgctgcggcc 
ccttgtcgtt 
cacggaggct 
cttggagttg 
aagggtgtac 
agctagacac 
gtgggcaagg 
tgaaaaagcc 
acct cagatc 
aggtgagatc 
ctggagacat 
cacttgtggc 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
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aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 7620 

tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 7680 

agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 7740 

gtaggcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 7800 

ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 7860 

ttttctcctt tttttttcct ctttttttcc ttttctttcc tttggtggct ccatcttagc 7920 

cctagtcacg gctagctgtg aaaggtccgt gagccgcttg actgcagaga gtgctgatac 7980 

tggcctctct gcagatcaag t 8001 
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<210> 4 
<211> 7989 
<212> DNA 

<213> Hepatitis C Virus 



<400> 4 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 

cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 

ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 

acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 

cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 

tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 

aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 

cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 

ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 

ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 

gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 

tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 

ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 

agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 

gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 

cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 

ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 

aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 

gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 

aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 

gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 

tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 

atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 

aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 

atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 

agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 

acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 

ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 

caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 

ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 

ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 

ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 

acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 

cggtccccgg tcttcacgga caactcgtcc"cctccggccg taccgcagac attccaggtg 2400 

gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 

gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 

gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 

accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 

tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 

atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 

ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 

gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 

aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2 94 0 

aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 

ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 

ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 

ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 

cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 

gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 

gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 

ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 

acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 

ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 

caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
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tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4 440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4 620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 594 0 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc"accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 68 4 0 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
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ttatccagct ggttcgttgc tggttacagc 

gcccgacccc gctggttcat gtggtgccta 

ctactcccca accgatgaac ggggagctaa 

tttccctttt tttttttctt tttttttttt 

tttttcctct ttttttcctt ttctttcctt 

tagctgtgaa aggtccgtga gccgcttgac 
agatcaagt 



gggggsigc^ca tatatcacag cctgtctcgt 7680 

ctcctacttt ctgtaggggt aggcatctat 7740 

acactccagg ccaataggcc atcctgtttt 7800 

tttttttttt tttttttttt ttctcctttt 7860 

tggtggctcc atcttagccc tagtcacggc 7920 

tgcagagagt gctgatactg gcctctctgc 7980 

7989 
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<210> 5 
<211> 8649 
<212> DNA 

<213> Hepatitis C Virus 



<400> 5 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaactgtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcattc 

gtgaaggaag 

t gcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggt taa 

cacgataata 

ctgatactct 

ttacaatatt 

gttcgggggg 

tttaccatca 

ataaccaaag 

cggaaggttg 

ggtacgtacg 

gaccttgcgg 

tggggggcag 

agggggaggg 

ct cgcgccta 

agcctcacag 

acacaatctt 

ggctcaaaga 

caggacctcg 

ggcagctcgg 

ggcgacagca 

ggcggtccac 

acccgagggg 

cggtccccgg 

gcccatctac 

gcccaagggt 

gcgtatatgt 

accacgggtg 

tctgggggcg 

atcctgggca 

ctcgccaccg 

gctctgtcca 

aaggggggga 



gattgggggc 
gaaagcgt ct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggct c 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctct c 
gtatgggatc 
aaaacgtct a 
ccatggaccg 
tgaccttgt c 
ttat caccag 
gccgcgatgc 
ccaaaatctt 
tgccgtactt 
ctgggggtca 
tttatgacca 
tggcagttga 
acaccgcggc 
agatacatct 
ttacggccta 
gccgggacag 
tcctggcgac 
cccttgccgg 
tcggctggca 
acctttactt 
gggggagcct 
tgctctgccc 
ttgcgaaggc 
tcttcacgga 
acgcccctac 
ataaggtgct 
ctaaggcaca 
cccccat cac 
cctatgacat 
tcggcacagt 
ctacgcctcc 
gcactggaga 
ggcacctcat 



gacact ccac 
agccatggcg 
tagtggt ctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggctg 
caccaagcga 
caggatgat c 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaat gggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ct caagcgta 
tgatctgggg 
ggccccccga 
ggagatggca 
accgcactat 
ggccgaggca 
cgtcatcctc 
gctcgccata 
cgtgcgcgca 
ttatgtccaa 
tctcacccca 
gcccgtcgt c 
gtgtggggac 
gggaccggca 
ctcccaacag 
gaaccaggt c 
ctgcgtcaat 
cccaaagggc 
agcgcccccc 
ggtcacgagg 
actctccccc 
ctcggggcac 
ggtggacttt 
caactcgtcc 
tggtagcggc 
tgtcctgaac 
tggtatcgac 
gtactccacc 
cataatatgt 
cctggaccaa 
gggatcggtc 
aatccccttt 
tttctgccat 



catagat cac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
at caggacat 
accgct tcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
gcatcgtgcg 
aagctgttcc 
cactt gcaag 
ctcacgtgcg 
ctcggtccac 
cacgggctca 
atggctctca 
ctgcgggact 
ttctctgata 
atcatcttgg 
gacagccttg 
acgcgaggcc 
gagggggagg 
ggcgtgtgtt 
ccaatcaccc 
ggggcgcgtt 
catgccgatg 
aggcccgtct 
gctgtgggca 
gtacccgtcg 
cctccggccg 
aagagcacta 
ccgtccgtcg 
cctaacatca 
tatggcaagt 
gatgagtgcc 
gcggagacgg 
accgtgccac 
tatggcaaag 
tccaagaaga 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttctt c 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
gaggcgcggt 
tcgctaggct 
tgtggatccc 
cgatccaccc 
tcatggtgct 
ttcgtgcatg 
tgaagttggc 
gggcccacgc 
t ggagaccaa 
gcctgcccgt 
aagggcaggg 
tacttggctg 
tccaagtggt 
ggactgtcta 
aaatgtacac 
ccttgacacc 
tcattccggt 
cctacttgaa 
tctttcgggc 
agtctatgga 
taccgcagac 
aggtgccggc 
ccgccaccct 
gaaccggggt 
ttcttgccga 
actcaactga 
ctggagcgcg 
atccaaacat 
ccatccccat 
aatgtgatga 



ggaactactg 
cct ccaggac 
gaat t gccag 
gcgtgccccc 
gcctgatagg 
aatcct aaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgatc 
cgtactcgga 
ctcgcgccag 
gt cgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
tttcgtaggt 
catatggtgg 
ccccctcaac 
agagctaatc 
ccaggctggt 
catgctggtg 
cgcactgaca 
gggcctacga 
ggttatcacc 
ctccgcccgc 
gtggcgactc 
catcatcact 
ctccaccgca 
tcatggtgcc 
caatgtggac 
atgcacctgc 
gcgccggcgg 
gggctcttcg 
tgccgtgtgc 
aaccactatg 
attccaggtg 
tgcgtatgca 
aggtttcggg 
aaggaccatc 
cggtggttgc 
ct cgaccact 
actcgtcgtg 
cgaggaggtg 
cgagaccatc 
gctcgccgcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3660 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3720 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3780 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3840 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3900 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3960 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 4020 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 4080 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 4140 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 4200 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 4260 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 4320 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 4380 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 4440 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 4500 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 4560 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 4 620 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4 680 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4740 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4800 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4860 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4920 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4980 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 5040 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 5100 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 5160 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 5220 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 5280 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 5340 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 5400 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 5460 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 5520 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 5580 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 5640 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5700 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5760 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5820 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5880 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5940 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 6000 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 6060 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 6120 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 6180 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 6240 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 6300 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 6360 
cctccacgga ggaagaggac ggttgt cctg"tcagaatcta ccgtgtcttc tgccttggcg 6420 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 6480 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 6540 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 6600 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6660 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6720 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6780 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6840 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6900 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6960 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 7020 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 7080 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 7140 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 7200 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 7260 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 7320 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 7380 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 7440 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 7500 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 7560 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 7620 
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gcggggaccc aagaggacga ggcgagccta 
tctgcccccc ctggggaccc gcccaaacca 
tcctccaatg tgtcagtcgc gcacgatgca 
gaccccacca ccccccttgc gcgggctgcg 
tcctggctag gcaacatcat catgtatgcg 
actcatttct tctccatcct tctagctcag 
atctacgggg cctgttactc cattgagcca 
catggcctta gcgcattttc actccatagt 
tcatgcctca ggaaacttgg ggtaccgccc 
gtccgcgcta ggctactgtc ccaggggggg 
aactgggcag taaggaccaa gctcaaactc 
ttatccagct ggttcgttgc tggttacagc 
gcccgacccc gctggttcat gtggtgccta 
ctactcccca accgatgaac ggggagctaa 
tttccctttt tttttttctt tttttttttt 
tttttcctct ttttttcctt ttctttcctt 
tagctgtgaa aggtccgtga gccgcttgac 
agatcaagt 



cgggccttca cggaggctat gactagatac 7680 
gaatacgact tggagttgat aacatcatgc 7740 
tctggcaaaa gggtgtacta tctcacccgt 7800 
tgggagacag ctagacacac tccagtcaat 7860 
cccaccttgt gggcaaggat gatcctgatg 7920 
gaacaacttg aaaaagccct agattgtcag 7980 
cttgacctac ctcagatcat tcaacgactc 8040 
tactctccag gtgagatcaa tagggtggct 8100 
ttgcgagtct ggagacatcg ggccagaagt 8160 
agggctgcca cttgtggcaa gtacctcttc 8220 
actccaatcc cggctgcgtc ccagttggat 8280 
gggggagaca tatatcacag cctgtctcgt 8340 ^ 
ctcctacttt ctgtaggggt aggcatctat 8400 
acactccagg ccaataggcc atcctgtttt 8460 
tttttttttt tttttttttt ttctcctttt 8520 
tggtggctcc atcttagccc tagtcacggc 8580 
tgcagagagt gctgatactg gcctctctgc 8640 

8649 
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<210> 6 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 

<400> 6 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaacgt aacaccaacg 
acgcaggttc tccggccgct tgggtggaga 
caatcggctg ctctgatgcc gccgtgttcc 
ttgtcaagac cgacctgtcc ggtgccctga 
cgtggctggc cacgacgggc gttccttgcg 
gaagggactg gctgctattg ggcgaagtgc 
ctcctgccga gaaagtatcc atcatggctg 
cggctacctg cccattcgac caccaagcga 
tggaagccgg tcttgtcgat caggatgatc 
ccgaactgtt cgccaggctc aaggcgcgca 
atggcgatgc ctgcttgccg aatatcatgg 
actgtggccg gctgggtgtg gcggaccgct 
ttgctgaaga gcttggcggc gaatgggctg 
ctcccgattc gcagcgcatc gccttctatc 
cagaccacaa cggtttccct ctagcgggat 
aacgttactg gccgaagccg cttggaataa 
tccaccatat tgccgtcttt tggcaatgtg 
acgagcattc ctaggggtct ttcccctctc 
gtgaaggaag cagttcctct ggaagcttct 
tgcaggcagc ggaacccccc acctggcgac 
taagatacac ctgcaaaggc ggcacaaccc 
gaaagagtca aatggctctc ctcaagcgta 
gtaccccatt gtatgggatc tgatctgggg 
tcgaggttaa aaaacgtcta ggccccccga 
cacgataata ccatggcgcc tattacggcc 
tgcatcatca ctagcctcac aggccgggac 
gtctccaccg caacacaatc tttcctggcg 
tatcatggtg ccggctcaaa gacccttgcc 
accaatgtgg accaggacct cgtcggctgg 
ccatgcacct gcggcagctc ggacctttac 
gtgcgccggc ggggcgacag cagggggagc 
aagggctctt cgggcggtcc actgctctgc 
gctgccgtgt gcacccgagg ggttgcgaag^ 
gaaaccacta tgcggtcccc ggtcttcacg 
acattccagg tggcccatct acacgcccct 
gctgcgtatg cagcccaagg gtataaggtg 
ctaggtttcg gggcgtatat gtctaaggca 
gtagggacca tcaccacggg tgcccccatc 
gacggtggtt gctctggggg cgcctatgac 
gactcgacca ctatcctggg catcggcaca 
cgactcgtcg tgctcgccac cgctacgcct 
atcgaggagg tggctctgtc cagcactgga 
atcgcgacca tcaagggggg gaggcacctc 
gagctcgccg cgaagctatc cggcctcgga 
gatgtatccg tcataccaac tagcggagac 
acgggcttta ccggcgattt cgactcagtg 
gtcgacttca gcctggaccc gaccttcacc 
gtgtcacgct cgcagcggcg aggcaggact 
gtgactccag gagaacggcc ctcgggcatg 
gacgcgggct gtgcttggta cgagctcacg 
tacctaaaca caccagggct gcccgtctgc 
tttacaggcc tcacccacat agacgcccat 
aacttcccct acctggtagc ataccaggct 
ccatcgtggg accaaatgtg ggagtgtctc 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ggcgcgccat gattgaacaa gatggattgc 420 
ggctattcgg ctatgactgg gcacaacaga 480 
ggctgtcagc gcaggggcgc ccggttcttt 540 
atgaactgca ggacgaggca gcgcggctat 600 
cagctgtgct cgacgttgtc actgaagcgg 660 
cggggcagga tctcctgtca tctcaccttg 720 
atgcaatgcg gcggctgcat acgcttgatc 780 
aacatcgcat cgagcgagca cgtactcgga 840 
tggacgaaga gcatcagggg ctcgcgccag 900 
tgcccgacgg cgaggatctc gtcgtgaccc 960 
tggaaaatgg ccgcttttct ggattcatcg 1020 
atcaggacat agcgttggct acccgtgata 1080 
accgcttcct cgtgctttac ggtatcgccg 1140 
gccttcttga cgagttcttc tgagtttaaa 1200 
caattccgcc cctctccctc ccccccccct 1260 
ggccggtgtg cgtttgtcta tatgttattt 1320 
agggcccgga aacctggccc tgtcttcttg 1380 
gccaaaggaa tgcaaggtct gttgaatgtc 1440 
tgaagacaaa caacgtctgt agcgaccctt 1500 
aggtgcctct gcggccaaaa gccacgtgta 1560 
cagtgccacg ttgtgagttg gatagttgtg 1620 
ttcaacaagg ggctgaagga tgcccagaag 1680 
cctcggtgca catgctttac atgtgtttag 1740 
accacgggga cgtggttttc ctttgaaaaa 1800 
tactcccaac agacgcgagg cctacttggc 1860 
aggaaccagg tcgaggggga ggtccaagtg 1920 
acctgcgtca atggcgtgtg ttggactgtc 1980 
ggcccaaagg gcccaatcac ccaaatgtac 2040 
caagcgcccc ccggggcgcg ttccttgaca 2100 
ttggtcacga ggcatgccga tgtcattccg 2160 
ctactctccc ccaggcccgt ctcctacttg 2220 
ccctcggggc atgctgtggg catctttcgg 2280 
gcggtggact ttgtacccgt cgagtctatg 2340 
gacaactcgt cccctccggc cgtaccgcag 2400 
actggtagcg gcaagagcac taaggtgccg 2460 
cttgtcctga acccgtccgt cgccgccacc 2520 
catggtatcg accctaacat cagaaccggg 2580 
acgtactcca cctatggcaa gtttcttgcc 2640 
atcataatat gtgatgagtg ccactcaact 2700 
gtcctggacc aagcggagac ggctggagcg 27 60 
ccgggatcgg tcaccgtgcc acatccaaac 2820 
gaaatcccct tttatggcaa agccatcccc 2880 
attttctgcc attccaagaa gaaatgtgat 2940 
ctcaatgctg tagcatatta ccggggcctt 3000 
gtcattgtcg tagcaacgga cgctctaatg 3060 
atcgactgca atacatgtgt cacccagaca 3120 
attgagacga cgaccgtgcc acaagacgcg 3180 
ggtaggggca ggatgggcat ttacaggttt 3240 
ttcgattcct cggttctgtg cgagtgctat 3300 
cccgccgaga cctcagttag gttgcgggct 3360 
caggaccatc tggagttctg ggagagcgtc 3420 
ttcttgtccG agactaggca ggcaggagac 3480 
acggtgtgcg ccagggctca ggctccacct 354 0 
atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc 
accaaataca 
ctggtaggcg 
attgtgggca 
taccgggagt 
atgcagctcg 
caagcggagg 
gcgaagcata 
cctggcaacc 
accacccaac 
cctcccagcg 
ataggccttg 
gcgctcgtgg 
ctactccctg 
ctgcgtcggc 
ttcgcttcgc 
gcacgtgtca 
cagtggatca 
gattggatat 
cgattgccgg 
gacggcatca 
ggttccatga 
attaacgcgt 
ctgtggcggg 
gtgacgggca 
ttcacagaag 
cgggaggagg 
tgcgggcccg 
acggcggaga 
tcagctagcc 
ccggacgctg 
acccgcgtgg 
gaggaggatg 
cct cgagcga 
aaggacccgg 
cctccgatac 
tctgccttgg 
agcggcacgg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
gtcaatgcac 
cgcagcgcaa 
cactaccggg 
ctatccgtgg 
tatggggcaa 
tggaaggact 
gaggt tttct 
ccagatttgg 
cctcaggccg 
ttcctggtga 
tgttttgact 
tgtgacttgg 
gggggccccc 
ggtgtactga 
tgtcgagctg 
atctgtgaaa 
atgactagat 
ataacatcat 
tatctcaccc 
actccagtca 
atgatcctga 
ctagattgtc 
attcaacgac 
aatagggtgg 
cgggccagaa 
aagtacctct 



tgtataggct 
tcatggcatg 
gagtcctagc 
ggatcatctt 
tcgatgagat 
ccgaacaatt 
ctgctgctcc 
tgtggaat tt 
ccgcgat age 
ataccct cct 
ctgcttctgc 
ggaaggtgct 
cctttaaggt 
ctatcctctc 
acgtgggccc 
ggggtaacca 
ctcagatcct 
acgaggactg 
gcacggtgtt 
gagtcccct t 
tgcaaaccac 
ggatcgtggg 
acaccacggg 
tggctgctga 
tgaccactga 
tggatggggt 
tcacattcct 
aaccggacgt 
cggctaagcg 
agctgtctgc 
acctcat cga 
agtcagaaaa 
agagggaagt 
tgcccatatg 
actacgt ccc 
cacctccacg 
cggagctcgc 
caacggcctc 
actcctccat 
ggtctaccgt 
ggacaggcgc 
tgagcaactc 
gcctgcggca 
acgt gctcaa 
aggaagcctg 
aggacgt ccg 
tgctggaaga 
gcgt ccaacc 
gggttcgtgt 
tgatgggctc 
atgcctggaa 
caacggtcac 
cccccgaagc 
tgactaattc 
cgaccagctg 
cgaagctcca 
gcgcggggac 
actctgcccc 
gctcctccaa 
gtgaccccac 
attcctggct 
tgactcattt 
agat ctacgg 
tccatggcct 
cttcatgcct 
gtgtccgcgc 
tcaactgggc 



gggagccgtt 
catgtcggct 
agctctggcc 
gt ccggaaag 
ggaagagtgc 
caaacagaag 
cgtggtggaa 
cat cagcggg 
atcactgatg 
gtttaacatc 
tttcgtaggc 
tgtggatatt 
catgagcggc 
ccctggcgcc 
aggggagggg 
cgtctccccc 
ctctagtctt 
ct ccacgcca 
gactgatttc 
ctt ctcatgt 
ctgcccatgt 
gcctaggacc 
cccctgcacg 
ggagtacgtg 
caacgtaaag 
gcggttgcac 
ggtcgggctc 
agcagtgctc 
taggctggcc 
gccttccttg 
ggccaacctc 
taaggtagta 
atccgttccg 
ggcacgcccg 
tccagtggta 
gagggagagg 
cacaaagacc 
tcctgaccag 
gccccccctt 
aagcgaggag 
cctgatcacg 
tttgctccgt 
gaagaaggtc 
ggagatgaag 
taagctgacg 
gaacctatcc 
cactgagaca 
agagaagggg 
gtgcgagaaa 
ttcatacgga 
agcgaagaaa 
tgagaatgac 
cagacaggcc 
taaagggcag 
cggtaatacc 
ggactgcacg 
ccaagaggac 
ccctggggac 
tgtgtcagtc 
cacccccctt 
aggcaacatc 
cttctccatc 
ggcctgttac 
tagcgcattt 
caggaaactt 
taggctactg 
agtaaggacc 



caaaacgagg 
gacctggagg 
gcgtattgcc 
ccggccatca 
gcctcacacc 
gcaatcgggt 
tccaagtggc 
atacaatat t 
gcattcacag 
ctggggggat 
gccggcatcg 
ttggcaggtt 
gagatgccct 
ctagtcgtcg 
gctgtgcagt 
acgcactatg 
accatcactc 
tgct ccggct 
aagacct ggc 
caacgtgggt 
ggagcacaga 
tgtagtaaca 
ccctccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 
attttggact 
gcggagatcc 
gattacaacc 
cacgggtgt c 
acggttgtcc 
ttcggcagct 
ccctccgacg 
gagggggagc 
gctagtgagg 
ccatgcgctg 
caccacaact 
acctttgaca 
gcgaaggcgt 
cccccacat t 
agcaaggccg 
ccaattgaca 
ggccgcaagc 
atggcccttt 
ttccaatact 
tgccctatgg 
atccgtgttg 
ataaggtcgc 
aactgcggct 
ct cacatgt t 
atgctcgtat 
gaggcgagcc 
ccgcccaaac 
gcgcacgatg 
gcgcgggctg 
atcatgtatg 
cttctagctc 
t ccattgagc 
tcactccata 
ggggtaccgc 
tcccaggggg 
aagctcaaac 



ttactaccac 
tcgtcacgag 
tgacaacagg 
ttcccgacag 

tcccttacat 
tgctgcaaac 
ggaccct cga 
tagcaggct t 
cctctat cac 
gggtggccgc 
ctggagcggc 
atggagcagg 
ccaccgagga 
gggtcgtgtg 
ggatgaaccg 
tgtctgagag 
agctgctgaa 
cgtggctaag 
tccagtccaa 
acaagggagt 
tcaccggaca 
cgtggcatgg 
cgccaaatt a 
gggtggggga 
aggttccggc 
cagcgtgcaa 
tggttgggtc 
tcaccgaccc 
ccccctcctt 
gcactacccg 
aggagatggg 
ctttcgagcc 
tgcggaggt c 
ctccactgtt 
cattgccgcc 
tgtcagaatc 
ccgaatcgt c 
acggcgacgc 
cgggggatcc 
acgtcgtctg 
cggaggaaac 
tggtctatgc 
gactgcaggt 
ccacagttaa 
cggccagat c 
ttaaccacat 
ccaccatcat 
cagctcgcct 
acgatgtggt 
ctcctggaca 
gcttcgcata 
aggagtcaat 
tcacagagcg 
atcgccggtg 
atttgaaggc 
gcggagacga 
t acgggcctt 
cagaatacga 
catctggcaa 
cgtgggagac 
cgcccacctt 
aggaacaact 
cacttgacct 
gttactctcc 
ccttgcgagt 
ggagggctgc 
tcactccaat 



acaccccata 
cacctgggtg 
cagcgtggtc 
ggaagtcctt 
cgaacaggga 
agccaccaag 
agccttctgg 
gtccactctg 
cagcccgct c 
ccaacttgct 
tgttggcagc 
ggtggcaggc 
cctggttaac 
cgcagcgata 
gctgatagcg 
cgacgctgca 
gaggcttcac 
agatgtttgg 
gctcctgccg 
ctggcggggc 
tgtgaaaaac 
aacat tcccc 
tt ctagggcg 
ttt ccact ac 
ccccgaattc 
acccctccta 
acagctccca 
ctcccacatt 
ggccagctca 
tcatgactcc 
cgggaacatc 
gctccaagcg 
caggaaatt c 
agagtcctgg 
tgccaaggcc 
taccgtgtct 
ggccgtcgac 
gggat ccgac 
cgatctcagc 
ctgctcgatg 
caagctgccc 
tacaacatct 
cctggacgac 
ggctaaactt 
taaatttggc 
ccgctccgtg 
ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgggtcgag 
tgacacccgc 
ctaccaatgt 
gctttacatc 
ccgcgcgagc 
cgctgcggcc 
ccttgtcgtt 
cacggaggct 
cttggagttg 
aagggtgtac 
agctagacac 
gtgggcaagg 
tgaaaaagcc 
acctcagatc 
aggtgagatc 
ctggagacat 
cacttgtggc 
cccggctgcg 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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tcccagttgg atttatccag 
agcctgtctc gtgcccgacc 
gtaggcatct atctactccc 
ccatcctgtt tttttccctt 

ttttctcctt tttttttcct 
cctagtcacg gctagctgtg 
tggcctctct gcagatcaag 



ctggttcgtt gctggttaca 
ccgctggttc atgtggtgcc 
caaccgatga acggggagct 
tttttttttc tttttttttt 

ctttttttcc ttttctttcc 
aaaggtccgt gagccgcttg 
t 



gcgggggaga catatatcac 7680 
tactcctact ttctgtaggg 7740 
aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt 7860 

tttggtggct ccatcttagc 7920 
actgcagaga gtgctgatac 7980 

8001 
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<210> 7 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



<400> 7 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 24 60 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
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tgtggttcgg 
cactacccct 
gagcacaggc 
agggacagat 
tgttccttca 
gtggacgtac 
gagtatgtcc 
atgatgctgc 
gcatccgtgg 
tacat caagg 
ctcctgctcc 
tcgtgcggag 
ct gtt cctcg 
tt gcaagtgt 
acgtgcgcga 
ggt ccactca 
gggctcattc 
gctctcatga 
cgggactggg 
tctgatatgg 
atcttgggcc 
agcct tgaag 
cgaggcctac 
ggggaggtcc 
gtgtgttgga 
at cacccaaa 
gcgcgttcct 
gccgatgt ca 
cccgtctcct 
gtgggcatct 
cccgtcgagt 
ccggccgtac 
agcactaagg 
tccgt cgccg 
aacat cagaa 
ggcaagttt c 
gagtgccact 
gagacggctg 
gt gccacat c 
ggcaaagcca 
aagaagaaat 
tattaccggg 
acggacgctc 
tgtgtcaccc 
gtgccacaag 
ggcatttaca 
ctgtgcgagt 
gttaggt tgc 
ttctgggaga 
aggcaggcag 
gctcaggctc 
acgctgcacg 
accacacacc 
acgagcacct 
acaggcagcg 
gacagggaag 
tacatcgaac 
caaacagcca 
ctcgaagcct 
ggcttgt cca 
atcaccagcc 
gccgcccaac 
gcggctgttg 
gcaggggtgg 
gaggacctgg 
gtgtgcgcag 
aaccggctga 



ggccttggtt 
gcactgtcaa 
tcgaagccgc 
cagagcttag 
ccaccctacc 
aatacctgta 
tgttgctctt 
tgatagctca 
ccggggcgca 
gcaggctggt 
tgctggcgtt 
gcgcggtttt 
ctaggct cat 
ggatcccccc 
t ccacccaga 
tggtgctcca 
gtgcatgcat 
agttggccgc 
cccacgcggg 
agaccaaggt 
t gcccgt etc 
ggcaggggtg 
t tggctgcat 
aagtggtctc 
ctgtctatca 
tgtacaccaa 
tgacaccatg 
ttccggtgcg 
acttgaaggg 
ttcgggctgc 
ctatggaaac 
cgcagacatt 
tgccggctgc 
ccaccctagg 
ccggggtagg 
ttgccgacgg 
caactgact c 
gagcgcgact 
caaacatcga 
tccccatcgc 
gtgatgagct 
gccttgatgt 
taatgacggg 
agacagtcga 
acgcggtgtc 
ggtttgtgac 
gctatgacgc 
gggctt acct 
gcgt ctttac 
gagacaactt 
cacctccatc 
ggccaacgcc 
ccataaccaa 
gggtgctggt 
tggtcattgt 
tcctttaccg 
agggaatgca 
ccaagcaagc 
tctgggcgaa 
ctctgcctgg 
cgctcaccac 
ttgctcctcc 
gcagcatagg 
caggcgcgct 
ttaacctact 
cgatactgcg 
tagcgttcgc 



gacacccaga 
ctttaccatc 
atgcaattgg 
cccgctgctg 
ggctctgtcc 
cggtataggg 
ccttcttctg 
agctgaggcc 
tggcat tctc 
ccctggggcg 
accaccacga 
cgtaggtctg 
atggtggtta 
cctcaacgtt 
gctaat cttt 
ggctggtata 
gctggtgcgg 
actgacaggt 
cctacgagac 
tatcacctgg 
cgcccgcagg 
gcgactcctc 
catcactagc 
caccgcaaca 
tggtgccggc 
tgtggaccag 
cacctgcggc 
ccggcggggc 
ctcttcgggc 
cgtgtgcacc 
cactatgcgg 
ccaggtggcc 
gtatgcagcc 
tttcggggcg 
gaccatcacc 
tggttgctct 
gaccactat c 
cgtcgtgctc 
ggaggtggct 
gaccatcaag 
cgccgcgaag 
atccgtcata 
ctttaccggc 
cttcagcctg 
acgctcgcag 
tccaggagaa 
gggctgtgct^ 
aaacacacca 
aggcctcacc 
cccctacctg 
gtgggaccaa 
cctgctgt at 
atacatcatg 
aggcggagtc 
gggcaggatc 
ggagttcgat 
gctcgccgaa 
ggaggctgct 
gcatatgtgg 
caaccccgcg 
ccaacatacc 
cagcgctgct 
ccttgggaag 
cgtggccttt 
ccctgctatc 
tcggcacgtg 
ttcgcggggt 



tgcttggtcc 
t tcaaggtta 
actcgaggag 
ctgtctacaa 
actggtttga 
tcggcggttg 
gcggacgcgc 
gccctagaga 
t ccttcctcg 
gcatatgccc 
gcatacgcca 
atact ctt ga 
caatatttta 
cgggggggcc 
accatcacca 
accaaagtgc 
aaggttgctg 
acgtacgttt 
cttgcggtgg 
ggggcagaca 
gggagggaga 
gcgcctatta 
ctcacaggcc 
caatctt t cc 
tcaaagaccc 
gacct cgtcg 
agctcggacc 
gacagcaggg 
ggtccactgc 
cgaggggttg 
tccccggtct 
catctacacg 
caagggtata 
tatatgtcta 
acgggtgccc 
gggggcgcct 
ctgggcatcg 
gccaccgcta 
ctgtccagca 
ggggggaggc 
ctatccggcc 
ccaactagcg 
gatttcgact 
gacccgacct 
cggcgaggca 
cggccctcgg 
' tggtacgagc 
gggctgcccg 
cacatagacg 
gtagcatacc 
atgtgggagt 
aggctgggag 
gcatgcatgt 
ctagcagctc 
atcttgtccg 
gagatggaag 
caattcaaac 
gctcccgtgg 
aatttcatca 
atagcatcac 
ctcctgttta 
tctgctttcg 
gtgcttgtgg 
aaggtcatga 
ctctcccctg 
ggcccagggg 
aaccacgtct 



actacccata 
ggatgtacgt 
agcgttgtaa 
cggagtggca 
tccatctcca 
tctcctttgc 
gcgtctgtgc 
acctggtggt 
tgttcttctg 
tctacggcgt 
tggaccggga 
ccttgt cacc 
tcaccagggc 
gcgatgccgt 
aaatcttgct 
cgtacttcgt 
ggggtcatta 
atgaccatct 
cagttgagcc 
ccgcggcgtg 
tacatctggg 
cggcctactc 
gggacaggaa 
tggcgacctg 
ttgccggccc 
gctggcaagc 
tttacttggt 
ggagcctact 
tctgcccctc 
cgaaggcggt 
tcacggacaa 
cccctactgg 
aggtgcttgt 
aggcacatgg 
ccatcacgta 
atgacatcat 
gcacagtcct 
cgcctccggg 
ctggagaaat 
acct catt tt 
tcggactcaa 
gagacgtcat 
cagtgatcga 
tcaccattga 
ggactggtag 
gcatgttcga 
tcacgcccgc 
tctgccagga 
cccatttctt 
aggctacggt 
gtctcatacg 
ccgttcaaaa 
cggctgacct 
tggccgcgta 
gaaagccggc 
agtgcgcctc 
agaaggcaat 
tggaatccaa 
gcgggataca 
tgatggcatt 
acatcctggg 
taggcgccgg 
atattttggc 
gcggcgagat 
gcgccctagt 
agggqqctgt 
cccccacgca 



caggctttgg 
ggggggagtg 
cctggaggac 
ggtattgccc 
tcagaacgtc 
aatcaaatgg 
ctgcttgtgg 
cctcaacgcg 
tgctgcctgg 
atggccgcta 
gatggcagca 
gcactataag 
cgaggcacac 
cat cctcctc 
cgccatact c 
gcgcgcacac 
tgtccaaatg 
caccccactg 
cgtcgtcttc 
tggggacatc 
accggcagac 
ccaacagacg 
ccaggtcgag 
cgtcaat ggc 
aaagggccca 
gccccccggg 
cacgaggcat 
ctcccccagg 
ggggcatgct 
ggactttgta 
ctcgtcccct 
tagcggcaag 
cctgaacccg 
tatcgaccct 
ctccacctat 
aatatgtgat 
ggaccaagcg 
at cggtcacc 
ccccttttat 
ctgccattcc 
tgctgtagca 
tgtcgtagca 
ctgcaataca 
gacgacgacc 
gggcaggatg 
ttcctcggtt 
cgagacctca 
ccatctggag 
gtcccagact 
gtgcgccagg 
gctaaagcct 
cgaggttact 
ggaggtcgtc 
ttgcctgaca 
catcattccc 
acacctccct 
cgggttgctg 
gtggcggacc 
atatttagca 
cacagcctct 
gggatgggtg 
catcgctgga 
aggttatgga 
gccctccacc 
cgtcggggtc 
gcagtggatg 
ctatgtgtct 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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gagagcgacg ctgcagcacg tgtcactcag atcctctcta gtcttaccat cactcagctg 7680 

ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 

ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 

tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 

ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggagc acagatcacc 7920 

ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 

catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 

aattattcta gggcgctgtg gcgggtggct gctgaggagt acgtggaggt tacgcgggtg 8100 

ggggatttcc actacgtgac gggcatgacc actgacaacg taaagtgccc gtgtcaggtt 8160 

ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 

tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 

gggtcacagc tcccatgcgg gcccgaaccg gacgtagcag tgctcacttc catgctcacc 8340 ^ 

gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 8400 

tccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 

acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 

atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 

gagccgctcc aagcggagga ggatgagagg gaagtatccg ttccggcgga gatcctgcgg 8640 

aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 

ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 

ccgcctgcca aggcccctcc gataccacct ccacggaggg agaggacggt tgtcctgtca 8820 

gaatctaccg tgtcttctgc cttggcggag ctcgccacaa agaccttcgg cagctccgaa 8880 

tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 

gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 

gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 

gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 

gaaaccaagc tgcccgtcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 

tatgctacaa catctcgcag cgcaagcctg cggcagaaga aggtcacctt tgacagactg 9240 

caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 

gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 

agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 

cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 

atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 

cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 

gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 

ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 

gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 

tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 

gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 

cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttatttg 9960 

aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 

gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 

gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 

tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 

ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 

gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 

accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 

caacttgaaa aagccctaga ttgtcagatc" tacggggcct gttactccat tgagccactt 10440 

gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 

tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 10560 

cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 

gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 

ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 

ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 

ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 

ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 

tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 

tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 

agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 8 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 



<400> 8 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaactgtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcatt c 

gtgaaggaag 

tgcaggcagc 

taagatacac 

gaaagagtca 

gtaccccatt 

tcgaggttaa 

cacgataata 

tgcatcatca 

gtctccaccg 

tatcatggtg 

accaatgtgg 

ccatgcacct 

gtgcgccggc 

aagggctctt 

gctgccgtgt 

gaaaccacta 

acattccagg 

gctgcgtatg 

ctaggtttcg 

gtaaggacca 

gacggtggtt 

gactcgacca 

cgactcgtcg 

atcgaggagg 

atcgagacca 

gagctcgccg 

gatgtatccg 

acgggcttta 

gtcgacttca 

gtgtcacgct 

gtgactccag 

gacgcgggct 

tacctaaaca 

tttacaggcc 

aacttcccct 

ccatcgtggg 



gattgggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggct etc 
gtatgggatc 
aaaacgtcta 
ccatggcgcc 
ctagcctcac 
caacacaatc 
ccggctcaaa 
accaggacct 
gcggcagctc 
ggggcgacag 
cgggcggt cc 
gcacccgagg 
tgcggtcccc 
tggcccatct 
cagcccaagg 
gggcgtatat 
tcaccacggg 
gctctggggg 
ctatcctggg 
tgct cgccac 
tggctctgtc 
tcaagggggg 
cgaagctgtc 
tcataccaac 
ccggtgactt 
gcctggaccc 
cgcagcggcg 
gagaacggcc 
gtgcttggta 
caccagggtt 
tcacccacat 
acctggtagc 
accaaatgtg 



gacactccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgt gtt cc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcat ggctg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
tattacggcc 
aggccgggac 
tttcctggcg 
gacccttgcc 
cgtcggctgg 
ggacctttac 
cagggggagc 
actgctctgc 
ggttgcgaag 
ggtcttcacg 
acacgcccct 
gtataaggtg 
gtctaaggca 
tgcccccatc 
cgcctatgac 
catcggcaca 
cgctacgcct 
cagcactgga 
gaggcacctc 
cggcctcgga 
tagcggagac 
cgactcagtg 
gaccttcacc 
aggcaggact 
ctcgggcatg 
cgagctcacg 
gcccgtctgc 
agacgcccat 
ataccaggct 
gaagtgtctc 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgt cage 
atgaactgea 
cagetgtgct 
eggggeagga 
atgcaatgcg 
aacat egeat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttettga 
caattecgce 
ggceggtgtg 
agggecegga 
gecaaaggaa 
tgaagacaaa 
aggtgeetct 
eagtgceacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
tactcccaac 
aggaaccagg 
acctgcgtca 
ggcccaaagg 
caagcgcccc 
ttggtcacga 
ctaeteteec 
ecetcggggc 
gcggtggact 
gacaactcgt 
actggtagcg 
cttgtcctga 
catggtatcg 
acgtactcca 
atcataatat 
gtcctggacc 
ccgggatcgg 
gaaatcccct 
attttctgcc 
ctcaatgctg 
gtcattgtcg 
atcgactgca 
attgagacga 
ggtaggggca 
ttcgattcct 
cccgccgaga 
caggaccatc 
ttcttgtccc 
acggtgtgcg 
atacggctaa 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
agacgcgagg 
tcgaggggga 
atggcgtgtg 
gcccaatcac 
ccggggcgcg 
ggcatgccga 
ccaggcccgt 
acgctgtggg 
ttgtacccgt 
cccctccggc 
gcaagagcac 
acccgtccgt 
accctaacat 
cctatggcaa 
gtgatgagtg 
aagcggagac 
tcaccgtgcc 
tttatggcaa 
attccaagaa 
tagcatatta 
tagcaacgga 
atacatgtgt 
cgaccgtgcc 
ggatgggcat 
cggttctgtg 
cctcagttag 
tggagttctg 
agactaagca 
ccagggctca 
agcctacgct 



ggaactactg 
act ccaggac 
gaattgccag 
gcgtgccccc 
gcctgatagg 
aatcctaaac 
gatggattgc 
gcacaacaga 
ccggttcttt 
gcgcggctat 
actgaagcgg 
tctcaccttg 
acgcttgat c 
cgtactcgga 
ctcgcgccag 
gtcgtgaccc 
ggattcatcg 
acccgtgata 
ggtatcgccg 
tgagtttaaa 
ccccccccct 
tatgttattt 
tgtcttcttg 
gttgaatgtc 
agcgaccctt 
gccacgtgta 
gatagttgtg 
tgcccagaag 
atgtgtttag 
ctttgaaaaa 
cctacttggc 
ggtccaagtg 
ttggactgtc 
ccaaatgtac 
ttccttgaca 
tgtcattccg 
ctcctacttg 
catctttcgg 
cgagtctatg 
cgtaccgcag 
taaggtgccg 
cgccgccacc 
cagaatcggg 
gtttcttgcc 
ccactcaact 
ggctggagcg 
acatccaaac 
agccatcccc 
gaaatgtgat 
ccggggcctt 
cgctctaatg 
cacccagaca 
acaagacgcg 
ttacaggttt 
cgagtgctat 
gttgcgggct 
ggagagcgtc 
ggcaggagac 
ggctccacct 
gcacgggcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 



CA 02303526 2000-03-31 



acgcccctgc 
accaaataca 
ctggtaggcg 
attgtgggca 
taccgggagt 
atgcagct eg 
caagcggagg 
gcgaagcat a 
cctggcaacc 
accacccaac 
cctcccagcg 
ataggccttg 
gcgctcgtgg 
ctactccctg 
ctgcgtcggc 
ttcgcttcgc 
gcacgtgtca 
cagtggat ca 
gattggat at 
cgattgccgg 
gacggcat ca 
ggttccat ga 
attaacgcgt 
ctgtggcggg 
gtgacgggca 
ttcacagaag 
cgggaggagg 
tgcgagcccg 
acggcggaga 
tcagctagcc 
ccggacgctg 
acccgcgtgg 
gaggaggatg 
cctcgagcga 
aaggacccgg 
cctccgatac 
tctgccttgg 
agcggcacgg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
at caatgcac 
cgcagcgcaa 
cactaccggg 
ctatccgtgg 
tatggggcaa 
tggaaggact 
gaggttttct 
ccagatttgg 
cctcaggccg 
ttcctggtga 
tgttttgact 
tgtgacttgg 
gggggccccc 
ggtgtactga 
tgtcgagctg 
atctgtgaaa 
atgactagat 
ataacatcat 
tatctcaccc 
actccagtca 
atgatcctga 
ctagattgtc 
attcaacgac 
aatagggtgg 
cgggccagaa 
aagtacctct 



tgtataggct 
tcatggcatg 
gagtcctagc 
ggatcatctt 
tcgatgagat 
ccgaacaatt 
ctgctgct cc 
tgt ggaat tt 
ccgcgat age 
ataccctcct 
ctgcttctgc 
ggaaggtgct 
cctttaaggt 
ctatcctctc 
acgtgggccc 
ggggtaacca 
ctcagatcct 
acgaggactg 
gcacggtgtt 
gagtcccctt 
tgcaaaccac 
ggatcgtggg 
acaccacggg 
tggctgctga 
tgaccactga 
tggatggggt 
tcacattcct 
aaccggatgt 
cggctaagcg 
agctgt ctgc 
acct catcga 
agtcagaaaa 
agagggaagt 
t gcccat atg 
actacgtccc 
caccttcacg 
cggagctcgc 
caacggcctc 
actcctccat 
ggtctaccgt 
ggacaggcgc 
tgagcaact c 
gcctgcggca 
acgtgct caa 
aggaagcctg 
aggacgt ccg 
tgctggaaga 
gcgtccaacc 
gggttcgtgt 
tgatgggctc 
atgcctggaa 
caacggtcac 
cccccgaagc 
tgactaattc 
cgaccagctg 
cgaagctcca 
gcgcggggac 
actctgcccc 
gctcctccaa 
gtgaccccac 
attcctggct 
tgactcattt 
agatctacgg 
tccatggcct 
cttcatgcct 
gtgtccgcgc 
tcaactgggc 



gggagccgt t 

catgtcggct 

agctctggcc 

gtccggaaag 

ggaagagtgc 

caaacagaag 

cgtggt ggaa 

catcagcggg 

atcactgat g 

gt tt aacat c 

tttcgtaggc 

tgtggatatt 

catgagcggc 

ccctggcgcc 

aggggagggg 

cgtctccccc 

ctctagtctt 

ct ccacgcca 

gactgatttc 

cttctcatgt 

ctgcccatgt 

gcct aggacc 

CGcctgcacg 

ggagtacgtg 

cgacgtaaag 

gcggttgcac 

ggtcgggctc 

agcagtgctc 

taggctggcc 

gccttccttg 

ggccaacct c 

taaggtagta 

at ccgttccg 

ggcacgcccg 

tccagtggta 

gaggaagagg 

cacagagacc 

tcctgaccag 

gccccccctt 

aagcgaggag 

cctgatcacg 

tttgctccgt 

gaagaaggtc 

ggagatgaag 

taagctgacg 

gaacGtatcc 

cactgagaca~ 

agagaagggg 

gtgcgagaaa 

ttcatacgga 

agcgaagaaa 

tgagaatgac 

cagacaggcc 

taaagggcag 

cggtaatacc 

ggactgcacg 

ccaagaggac 

ccctggggac 

tgtgtcagtc 

cacccccctt 

aggcaacatc 

cttctccatc 

ggcctgttac 

tagcgcattt 

caggaaactt 

taggctactg 

agtaaggacc 



caaaacgagg 
gacctggagg 
gcgtattgcc 
ccggccat ca 
gcct cacacc 
gcaatcgggt 
tccaagtggc 
at acaatatt 
gcat t cacag 
ctggggggat 
gccggcatcg 
ttggcaggtt 
gagatgccct 
ctagtcgtcg 
gctgtgcagt 
acgcactatg 
accatcactc 
tgctccggct 
aagacctggc 
caacgtgggt 
ggggcacaga 
tgtagtaaca 
ccctccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 
attttggact 
gcggagatcc 
gattacaacc 
cacgggtgtc 
acggttgtcc 
ttcggcagct 
ccctccgacg 
gagggggagc 
gctagtgagg 
ccatgcgctg 
caccacaact 
acctttgaca 
gcgaaggcgt 
cccccacatt 
agcaaggccg 
ccaattgaca 
ggccgcaagc 
atggcccttt 
ttccaatact 
tgccctatgg 
atccgtgttg 
ataaggtcgc 
aactgcggct 
ctcacatgtt 
atgctcgtat 
gaggcgagcc 
ccgcccaaac 
gcgcacgatg 
gcgcgggctg 
at catgtatg 
ctt ctagctc 
tccattgagc 
tcactccata 
ggggtaccgc 
tcccaggggg 
aagctcaaac 



ttactaccac 
t cgtcacgag 
tgacaacagg 
ttcccgacag 
tcccttacat 
tgctgcaaac 
ggaccat cga 
tagcaggctt 
cct ct at cac 
gggtggccgc 
ctggagcggc 
atggagcagg 
ccaccgagga 
gggtcgtgtg 
ggatgaaccg 
tgcctgagag 
agctgctgaa 
cgtggctaag 
t ccagt ccaa 
acaagggagt 
tcaccggaca 
cgtggcatgg 
cgccaaatta 
gggtggggga 
aggttccggc 
cagcgtgcaa 
tggttgggtc 
tcaccgaccc 
ctcccccctt 
gcactacccg 
aggagatggg 
ctttcgagcc 
tgcggaggt c 
ctccactgtt 
cat tgccgcc 
tgtcagaatc 
ccgaatcgtc 
acggcgacgc 
cgggggatcc 
acgtcgtctg 
cggaggaaac 
tggtctatgc 
gactgcaggt 
ccacagttaa 
cggccagatc 
ttaaccacat 
ccaccat cat 
cagctcgcct 
acgatgtggt 
ctcctggaca 
gcttcgcata 
aggagtcaat 
tcacagagcg 
atcgccggtg 
acttgaaggc 
gcggagacga 
tacgggcctt 
cagaatacga 
catctggcaa 
cgtgggagac 
cgcccacctt 
aggaacaact 
cacttgacct 
gttactctcc 
ccttgcgagt 
ggagggctgc 
tcactccaat 



acaccccata 
cacctgggtg 
cagcgtggtc 
ggaagtcctt 
cgaacaggga 
agccaccaag 
agcct tctgg 
gtccactctg 
cagcccgctc 
ccaacttgct 
tgttggcagc 
ggtggcaggc 
cctggttaac 
cgcagcgata 
gctgatagcg 
cgacgctgca 
gaggcttcac 
agat gtttgg 
gctcctgccg 
ctggcggggc 
tgtgaaaaac 
aacattcccc 
ttctagggcg 
tttccactac 
ccccgaattc 
acccctccta 
acagctccca 
ctcccacatt 
ggccagctca 
tcatgactcc 
cgggaacatc 
gctccaagcg 
caggaaattc 
agagtcct gg 
tgccaaggcc 
taccgtgtct 
ggccgtcgac 
gggatccgac 
cgatctcagc 
ctgctcgatg 
caagctgccc 
tacaacatct 
cctggacgac 
ggctaaactt 
taaatttggc 
ccgct ccgtg 
ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgggtcgag 
tgacacccgc 
ctaccaatgt 
gctttacatc 
ccgcgcgagc 
cgctgcggcc 
ccttgtcgtt 
cacggaggct 
cttggagttg 
aagggtgtac 
agctagacac 
gtgggcaagg 
tgaaaaagcc 
acctcagatc 
aggtgagatc 
ctggagacat 
cacttgtggc 
cccggctgcg 



3660 
3720 
3780 
3840 

3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4 980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
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tcccagttgg atttatccag 
agcctgtctc gtgcccgacc 
gtaggcatct atctactccc 
ccatcctgtt tttttccctt 
ttttctcctt tttttttcct 
cctagtcacg gctagctgtg 
tggcctctct gcagatcaag 



ctggttcgtt gctggttaca 
ccgctggttc atgtggtgcc 
caaccgatga acggggagct 
tttttttttc tttttttttt 

ctttttttcc ttttctttcc 
aaaggtccgt gagccgcttg 
t 



gcgggggaga catatatcac 7680 
tactcctact ttctgtaggg 7740 
aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt 7860 

tttggtggct ccatcttagc 7920 
actgcagaga gtgctgatac 7980 

8001 
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<210> 9 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



<400> 9 

gccagccccc 

tcttcacgca 

cccccctccc 

gacgaccggg 

gcgagactgc 

gtgcttgcga 

ctcaaagaaa 

acgcaggttc 

caatcggctg 

ttgtcaagac 

cgtggctggc 

gaagggactg 

ctcctgccga 

cggctacctg 

tggaagccgg 

ccgaact gtt 

atggcgatgc 

actgtggccg 

ttgctgaaga 

ctcccgattc 

cagaccacaa 

aacgttactg 

tccaccatat 

acgagcatt c 

gtgaaggaag 

tgcaggcagc 

taagat acac 

gaaagagt ca 

gtaccccatt 

tcgaggttaa 

cacgataata 

cgccgcccac 

ttgccgcgca 

caacctcgtg 

gctcagcccg 

ctcctgtcac 

cgcaatttgg 

attccgctcg 

gttctggagg 

ttccttttgg 

gtatccggag 

gcggacatga 

cgctgctggg 

acgatacgac 

gtgggagatc 

cggcacgaga 

cgtatggctt 

ttactccgga 

gcgggccttg 

ctctttgccg 

ct cgggatt a 

aacggcagct 

ttccttgctg 

gccagctgca 

tcacacagct 

gtacccgcgg 

gggacgaccg 

ctgcttctta 

agcactgggt 

aaaaccttga 



gat t gggggc 
gaaagcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgct at tg 
gaaagtat cc 
cccatt cgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggct etc 
gtatgggatc 
aaaacgtcta 
ccatgggcac 
aggacgtcaa 
ggggccccag 
gaaggcgaca 
ggtacccctg 
cccgtggctc 
gtaaggtcat 
tcggcgcccc 
acggcgtgaa 
ctttgctgt c 
tgtaccatgt 
tcatgcatac 
tagcgctcac 
gccatgtcga 
tctgcggatc 
cagtacagga 
gggatatgat 
tcccacaagc 
cctactattc 
gcgttgacgg 
cgtccctctt 
ggcacat caa 
cgctgttcta 
gccccatcga 
cggaccagag 
cgcaggtgtg 
accggttcgg 
acaacacgcg 
tcaccaagac 
cctgccccac 



gacact ccac 

agccatggcg 

t agt ggtct g 

gatcaacccg 

gtgttgggtc 

aggtctcgta 

aacaccaacg 

tgggtggaga 

gccgtgttcc 

ggtgccctga 

gt tcctt gcg 

ggcgaagtgc 

at catggctg 

caccaagcga 

caggatgat c 

aaggcgcgca 

aatatcatgg 

gcggaccgct 

gaatgggctg 

gccttctatc 

ctagcgggat 

cttggaataa 

tggcaatgtg 

ttcccctctc 

ggaagcttct 

acct ggcgac 

ggcacaaccc 

ctcaagcgta 

tgatctgggg 

ggccccccga 

gaat cct aaa 

gtt cccgggc 

gttgggtgtg 

acctatcccc 

gcccctctat 

tcggcctagt 

cgataccctc 

cctagggggc 

ctatgcaaca 

ctgtt tgacc ' 

cacgaacgac 

ccccgggtgc 

tcccacgctc 

tttgctcgtt 

tgttttcctc 

ctgcaattgc 

gatgaactgg 

tgtcgtggat 

catggtgggg 

gggaacctat 

ttcacccggg 

caggactgcc 

cgtgcacaag 

cgcgttcgct 

gccttattgt 

tggtccagtg 

cgtccctacg 

gccgccgcaa 

gtgcgggggc 

ggactgcttc 



catagatcac 
ttagtatgag 
cggaaccggt 
ct caatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
cggggcagga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
t gcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
t t caacaagg 
cctcggtgca 
accacgggga 
cctcaaagaa 
ggtggtcaga 
cgcgcgacta 
aaggctcgcc 
ggcaatgagg 
tggggcccca 
acgtgcggct 
gctgccaggg 
gggaatctgc 
'atcccagctt 
tgctccaacg 
gtgccctgcg 
gcggccagga 
ggggcggctg 
gtcgcccagc 
tcaatatatc 
tcacctacag 
atggtggcgg 
aactgggcta 
gtgacagggg 
tcatcccaga 
ctgaactgca 
ttcaactcat 
caggggtggg 
tggcactacg 
tactgctt ca 
tacagttggg 
ggcaactggt 
cccccgtgta 
cggaagcacc 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatct c 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
aaaccaaacg 
tcgtcggtgg 
ggaagacttc 
agcccgaggg 
gcttggggtg 
cggacccccg 
tcgccgatct 
ccctggcgca 
ccggttgctc 
ccgcttatga 
caagcattgt 
ttcgggagaa 
acgctagcgt 
ctctctgctc 
tgttcacctt 
ccggccacgt 
cagccctagt 
gggcccattg 
aggttctgat 
ggacgatggc 
aaatccagct 
atgactccct 
ctggatgccc 
ggcccatcac 
caccccggcc 
ccccaagccc 
gggagaatga 
ttggctgtac 
acatcggggg 
ccgaggccac 



ggaactactg 

cctccaggac 

gaattgccag 

gcgtgccccc 

gcctgatagg 

aatcctaaac 

gatggattgc 

gcacaacaga 

ccggttcttt 

gcgcggctat 

actgaagcgg 

tctcaccttg 

acgcttgat c 

cgtact cgga 

ctcgcgccag 

gtcgtgaccc 

ggat tcatcg 

acccgtgat a 

ggtatcgccg 

tgagtttaaa 

ccccccccct 

tatgttattt 

tgtcttcttg 

gttgaatgtc 

agcgaccctt 

gccacgtgta 

gatagttgtg 

tgcccagaag 

atgtgtttag 

ctttgaaaaa 

taacaccaac 

agtttacctg 

cgagcggtcg 

tagggcctgg 

ggcaggatgg 

gcgtaggtcg 

catggggtac 

tggcgtccgg 

cttttctatc 

agtgcgcaac 

gtatgaggca 

caactcctcc 

ccccactacg 

cgctatgtac 

ctcgcctcgc 

gacaggtcac 

ggtatcgcag 

gggagtccta 

tgtgatgcta 

caaaaacacc 

tgtaaacacc 

caacactggg 

agagcgcatg 

ttacaatgag 

gtgcggtatc 

tgtcgtggtg 

gacggacgtg 

atggatgaat 

gatcggcaat 

ttacaccaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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tgtggttcgg ggccttggtt gacacccaga 
cactacccct gcactgtcaa ctttaccatc 
gagcacaggc tcgaagccgc atgcaattgg 
agggacagat cagagcttag cccgctgctg 
tgttccttca ccaccctacc ggctctgtcc 
gtggacgtac aatacctgta cggtataggg 
gagtatgtcc tgttgctctt ccttcttctg 
atgatgctgc tgatagctca agctgaggcc 
gcatccgtgg ccggggcgca tggcattctc 
tacatcaagg gcaggctggt ccctggggcg 
ctcctgctcc tgctggcgtt accaccacga 
tcgtgcggag gcgcggtttt cgtaggtctg 
ctgttcctcg ctaggctcat atggtggtta 
ttgcaagtgt ggatcccccc cctcaacgtt 
acgtgcgcga tccacccaga gctaatcttt 
ggtccactca tggtgctcca ggctggtata 
gggctcattc gtgcatgcat gctggtgcgg 
gctctcatga agttggccgc actgacaggt 
cgggactggg cccacgcggg cctacgagac 
tctgatatgg agaccaaggt tatcacctgg 
atcttgggcc tgcccgtctc cgcccgcagg 
agccttgaag ggcaggggtg gcgactcctc 
cgaggcctac ttggctgcat catcactagc 
ggggaggtcc aagtggtctc caccgcaaca 
gtgtgttgga ctgtctatca tggtgccggc 
atcacccaaa tgtacaccaa tgtggaccag 
gcgcgttcct tgacaccatg cacctgcggc 
gccgatgtca ttccggtgcg ccggcggggc 
cccgtctcct acttgaaggg ctcttcgggc 
gtgggcatct ttcgggctgc cgtgtgcacc 
cccgtcgagt ctatggaaac cactatgcgg 
ccggccgtac cgcagacatt ccaggtggcc 
agcactaagg tgccggctgc gtatgcagcc 
tccgtcgccg ccaccctagg tttcggggcg 
aacatcagaa tcggggtaag gaccatcacc 
ggcaagtttc ttgccgacgg tggttgctct 
gagtgccact caactgactc gaccactatc 
gagacggctg gagcgcgact cgtcgtgctc 
gtgccacatc caaacatcga ggaggtggct 
ggcaaagcca tccccatcga gaccatcaag 
aagaagaaat gtgatgagct cgccgcgaag 
tattaccggg gccttgatgt atccgtcata 
acggacgctc taatgacggg ctttaccggt 
tgtgtcaccc agacagtcga cttcagcctg 
gtgccacaag acgcggtgtc acgctcgcag 
ggcatttaca ggtttgtgac tccaggagaa 
ctgtgcgagt gctatgacgc gggctgtgct 
gttaggttgc gggcttacct aaacacacca 
ttctgggaga gcgtctttac aggcctcacc 
aagcaggcag gagacaactt cccctacctg 
gctcaggctc cacctccatc gtgggaccaa 
acgctgcacg ggccaacgcc cctgctgtat 
accacacacc ccataaccaa atacatcatg 
acgagcacct gggtgctggt aggcggagtc 
acaggcagcg tggtcattgt gggcaggatc 
gacagggaag tcctttaccg ggagttcgat 
tacatcgaac agggaatgca gctcgccgaa 
caaacagcca ccaagcaagc ggaggctgct 
atcgaagcct tctgggcgaa gcatatgtgg 
ggcttgtcca ctctgcctgg caaccccgcg 
atcaccagcc cgctcaccac ccaacatacc 
gccgcccaac ttgctcctcc cagcgctgct 
gcggctgttg gcagcatagg ccttgggaag 
gcaggggtgg caggcgcgct cgtggccttt 
gaggacctgg ttaacctact ccctgctatc 
gtgtgcgcag cgatactgcg tcggcacgtg 
aaccggctga tagcgttcgc ttcgcggggt 



tgcttggtcc actacccata caggctttgg 3660 
ttcaaggtta ggatgtacgt ggggggagtg 3720 
actcgaggag agcgttgtaa cctggaggac 3780 
ctgtctacaa cggagtggca ggtattgccc 3840 
actggtttga tccatctcca tcagaacgtc 3900 
tcggcggttg tctcctttgc aatcaaatgg 3960 
gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
gccctagaga acctggtggt cctcaacgcg 4080 
tccttcctcg tgttcttctg tgctgcctgg 4140 
gcatatgccc tctacggcgt atggccgcta 4200 
gcatacgcca tggaccggga gatggcagca 4260 
atactcttga ccttgtcacc gcactataag 4320 
caatatttta tcaccagggc cgaggcacac 4380 
cgggggggcc gcgatgccgt catcctcctc 4440 
accatcacca aaatcttgct cgccatactc 4500 
accaaagtgc cgtacttcgt gcgcgcacac 4560 
aaggttgctg ggggtcatta tgtccaaatg 4620 
acgtacgttt atgaccatct caccccactg 4680 
cttgcggtgg cagttgagcc cgtcgtcttc 4740 
ggggcagaca ccgcggcgtg tggggacatc 4800 
gggagggaga tacatctggg accggcagac 4860 
gcgcctatta cggcctactc ccaacagacg 4920 
ctcacaggcc gggacaggaa ccaggtcgag 4 980 
caatctttcc tggcgacctg cgtcaatggc 5040 
tcaaagaccc ttgccggccc aaagggccca 5100 
gacctcgtcg gctggcaagc gccccccggg 5160 
agctcggacc tttacttggt cacgaggcat 5220 
gacagcaggg ggagcctact ctcccccagg 5280 
ggtccactgc tctgcccctc ggggcacgct 5340 
cgaggggttg cgaaggcggt ggactttgta 5400 
tccccggtct tcacggacaa ctcgtcccct 5460 
catctacacg cccctactgg tagcggcaag 5520 
caagggtata aggtgcttgt cctgaacccg 5580 
tatatgtcta aggcacatgg tatcgaccct 5640 
acgggtgccc ccatcacgta ctccacctat 5700 
gggggcgcct atgacatcat aatatgtgat 57 60 
ctgggcatcg gcacagtcct ggaccaagcg 5820 
gccaccgcta cgcctccggg atcggtcacc 5880 
ctgtccagca ctggagaaat ccccttttat 5940 
gggggg^ggc acctcatttt ctgccattcc 6000 
ctgtccggcc tcggactcaa tgctgtagca 6060 
ccaactagcg gagacgtcat tgtcgtagca 6120 
gacttcgact cagtgatcga ctgcaataca 6180 
gacccgacct tcaccattga gacgacgacc 6240 
cggcgaggca ggactggtag gggcaggatg 6300 
cggccctcgg gcatgttcga ttcctcggtt 6360 
tggtacgagc tcacgcccgc cgagacctca 6420 
gggttgcccg tctgccagga ccatctggag 6480 
cacatagacg cccatttctt gtcccagact 6540 
gtagcatacc aggctacggt gtgcgccagg 6600 
atgtggaagt gtctcatacg gctaaagcct 6660 
aggctgggag ccgttcaaaa cgaggttact 6720 
gcatgcatgt cggctgacct ggaggtcgtc 6780 
ctagcagctc tggccgcgta ttgcctgaca 6840 
atcttgtccg gaaagccggc catcattccc 6900 
gagatggaag agtgcgcctc acacctccct 6960 
caattcaaac agaaggcaat cgggttgctg 7020 
gctcccgtgg tggaatccaa gtggcggacc 7080 
aatttcatca gcgggataca atatttagca 7140 
atagcatcac tgatggcatt cacagcctct 7200 
ctcctgttta acatcctggg gggatgggtg 7260 
tctgctttcg taggcgccgg catcgctgga 7320 
gtgcttgtgg atattttggc aggttatgga 7380 
aaggtcatga gcggcgagat gccctccacc 74 40 
ctctcccctg gcgccctagt cgtcggggtc 7500 
ggcccagggg agggggctgt gcagtggatg 7560 
aaccacgtct cccccacgca ctatgtgcct 7620 
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gagagcgacg 
ctgaagaggc 
ctaagagatg 
tccaagctcc 
ggagtctggc 
ggacatgtga 
catggaacat 
aattatt eta 
ggggat t t cc 
ccggcccccg 
tgcaaacccc 
gggtcacagc 
gacccctccc 
cccttggcca 
acccgtcatg 
atgggcggga 
gagccgctcc 
aggtccagga 
ctgttagagt 
ccgcctgcca 
gaatctaccg 
tcgtcggccg 
gacgcgggat 
gatcccgatc 
gtctgctgct 
gaaaccaagc 
tatgctacaa 
caggtcctgg 
gttaaggcta 
agatctaaat 
cacat ccgct 
atcatggcaa 
cgccttatcg 
gtggtctcca 
ggacagcggg 
gcatatgaca 
tcaat ctacc 
gagcggcttt 
cggtgccgcg 
aaggccgctg 
gacgaccttg 
gccttcacgg 
tacgacttgg 
ggcaaaaggg 
gagacagcta 
accttgtggg 
caacttgaaa 
gacctacctc 
tctccaggtg 
cgagtctgga 
gctgccactt 
ccaatcccgg 
ggagacatat 
ctactttctg 
ctccaggcca 
tttttttttt 
tggctccatc 
agagagtgct 



ctgcagcacg 
ttcaccagtg 
t ttgggat tg 
t gccgcgatt 
ggggcgacgg 
aaaacggt tc 
t ccccatt aa 
gggcgct gt g 
actacgtgac 
aat tctt cac 
tcctacggga 
tcccatgcga 
acattacggc 
gctcatcagc 
actccccgga 
acatcacccg 
aagcggagga 
aat t ccctcg 
cctggaagga 
aggcccct cc 
tgtcttctgc 
t cgacagcgg 
ccgacgt tga 
t cagcgacgg 
cgatgtccta 
tgcccatcaa 
catctcgcag 
acgaccacta 
aacttctatc 
ttggctatgg 
ccgtgtggaa 
aaaatgaggt 
tattcccaga 
ccctccct ca 
t cgagtt cct 
cccgctgttt 
aatgttgtga 
acatcggggg 
cgagcggtgt 
cggcctgtcg 
tcgttatctg 
aggctatgac 
agttgataac 
tgtactatct 
gacacact cc 
caaggatgat 
aagccctaga 
agat cattca 
agatcaatag 
gacatcgggc 
gtggcaagta 
ctgcgtccca 
atcacagcct 
taggggtagg 
ataggccatc 
tttttttttc 
tt agccctag 
gatactggcc 



tgtcactcag 

gatcaacgag 

gatatgcacg 

gccgggagtc 

cat cat gcaa 

catgaggatc 

cgcgtacacc 

gcgggtggct 

gggcatgacc 

agaagtggat 

ggaggtcaca 

gcccgaaccg 

ggagacggct 

tagccagctg 

cgctgacctc 

cgtggagt ca 

ggatgagagg 

agcgatgccc 

cccggactac 

gataccacct 

cttggcggag 

cacggcaacg 

gtcgtactcc 

gtcttggtct 

cacatggaca 

tgcactgagc 

cgcaagcctg 

ccgggacgtg 

cgtggaggaa 

ggcaaaggac 

ggacttgctg 

t ttctgcgtc 

tttgggggtt 

ggccgtgatg 

ggtgaatgcc 

tgactcaacg 

cttggccccc 

ccccctgact 

actgacgacc 

agctgcgaag 

tgaaagcgcg 

tagatactct 

atcatgctcc 

cacccgtgac 

agtcaat t cc 

cctgatgact 

ttgt cagatc^ 

acgactccat 

ggtggcttca 

cagaagtgtc 

cctcttcaac 

gttggattta 

gtctcgtgcc 

catctatcta 

ctgttttttt 

tccttttttt 

t cacggctag 

tctctgcaga 



atcctctcta 
gactgctcca 
gtgttgactg 
cccttcttct 
accacctgcG 
gtggggccta 
acgggcccct 
gctgaggagt 
act gacgacg 
ggggtgcggt 
tt cctggtcg 
gatgtagcag 
aagcgtaggc 
tctgcgcctt 
atcgaggcca 
gaaaataagg 
gaagtatccg 
atatgggcac 
gt ccctccag 
tcacggagga 
ctcgccacag 
gcctctcctg 
tccatgcccc 
accgtaagcg 
ggcgccctga 
aactctttgc 
cggcagaaga 
ctcaaggaga 
gcctgtaagc 
gtccggaacc 
gaagacactg 
caaccagaga 
cgtgtgtgcg 
ggctcttcat 
tggaaagcga 
gtcactgaga 
gaagccagac 
aattctaaag 
agctgcggta 
ctccaggact 
gggacccaag 
gccccGcctg 
tccaatgtgt 
cccaccaccc 
tggctaggca 
catttcttct 
tacggggcct 
ggccttagcg 
tgcctcagga 
cgcgctaggc 
tgggcagtaa 
tccagctggt 
cgaccccgct 
ctccccaacc 
cccttttttt 
ttcctctttt 
ctgtgaaagg 
tcaagt 



gt cttaccat 
cgccatgctc 
atttcaagac 
catgtcaacg 
catgtggggc 
ggacctgtag 
gcacgccct c 
acgtggaggt 
t aaagtgccc 
t gcacaggt a 
ggctcaatca 
tgctcacttc 
tggccagggg 
ccttgaaggc 
acctcctgtg 
tagtaatttt 
ttccggcgga 
gcccggatt a 
tggtacacgg 
agaggacggt 
agaccttcgg 
accagccctc 
cccttgaggg 
aggaggctag 
tcacgccatg 
tccgtcacca 
aggtcacctt 
tgaaggcgaa 
tgacgccccc 
tatccagcaa 
agacaccaat 
aggggggccg 
agaaaatggc 
acggattcca 
agaaatgccc 
atgacatccg 
aggccataag 
ggcagaactg 
ataccctcac 
gcacgatgct 
aggacgaggc 
gggacccgcc 
cagtcgcgca 
cccttgcgcg 
acat catcat 
ccatccttct 
gttactccat 
cattttcact 
aacttggggt 
tactgtccca 
ggaccaagct 
tcgttgctgg 
ggttcatgtg 
gatgaacggg 
ttttcttttt 
tttccttttc 
tccgtgagcc 



cactcagctg 

cggctcgtgg 

ctggctccag 

tgggtacaag 

acagatcacc 

t aacacgt gg 

cccggcgcca 

tacgcgggtg 

gtgt caggtt 

cgct ccagcg 

atacctggtt 

catgctcacc 

atctcctccc 

aacatgcact 

gcggcaggag 

ggactctttc 

gatcctgcgg 

caaccct cca 

gtgtccattg 

tgtcctgtca 

cagctccgaa 

cgacgacggc 

ggagccgggg 

tgaggacgtc 

cgctgcggag 

caacttggtc 

tgacagactg 

ggcgtccaca 

acattcggcc 

ggccgttaac 

tgacaccacc 

caagccagct 

cctttacgat 

atactct cct 

tatgggctt c 

tgttgaggag 

gtcgctcaca 

cggctatcgc 

atgttacttg 

cgtatgcgga 

gagcctacgg 

caaaccagaa 

cgatgcatct 

ggctgcgtgg 

gtatgcgccc 

agctcaggaa 

tgagccactt 

ccatagttac 

accgcccttg 

gggggggagg 

caaactcact 

ttacagcggg 

gtgcctactc 

gagctaaaca 

tttttttttt 

tttcctttgg 

gcttgactgc 



7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11076 
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<210> 10 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 

<400> 10 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag gcggtggact ttgtacccgt cgagtctatg 2340 
ggaaccacta tgcggtcccc ggtcttcacg'^gacaact cgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 2460 
gctgcgtatg caggccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaatcggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgactt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
ccatcgtggg accaaatgtg gaagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc tgtataggct gggagccgtt 
accaaataca tcatggcatg catgtcggct 
ctggtaggcg gagtcctagc agctctggcc 
attgtgggca ggatcatctt gtccggaaag 
taccgggagt tcgatgagat ggaagagtgc 
atgcagctcg ccgaacaatt caaacagaag 
caagcggagg ctgctgctcc cgtggtggaa 
gcgaagcata tgtggaattt catcagcggg 
cctggcaacc ccgcgatagc atcactgatg 
accacccaac ataccctcct gtttaacatc 
Gctcccagcg ctgcttctgc tttcgtaggc 
ataggccttg ggaaggtgct tgtggatatt 
gcgctcgtgg cctttaaggt catgagcggc 
ctactccctg ctatcctctc ccctggcgcc 
ctgcgtcggc acgtgggccc aggggagggg 
ttcgcttcgc ggggtaacca cgtctccccc 
gcacgagtca ctcagatcct crctagtctt 
cagtggatca acgaggactg ctccacgcca 
gattggatat gcacggtgtt gactgatttc 
cgattgccgg gagtcccctt cttctcatgt 
gacggcatca tgcaaaccac ctgcccatgt 
ggttccatga ggatcgtggg gcctaggacc 
attaacgcgt acaccacggg cccctgcacg 
ctgtggcggg taggtgctga ggagtacgtg 
gtgacgggca tgaccactga caacgtaaag 
ttcacagaag tggatggggt gcggttgcac 
cgggaggagg tcacattcct ggtcgggctc 
tgcgagcctg aaccggatgt agcagtgctc 
acggcggaga cggctaagcg taggctggcc 
tcagctagcc agctgtctgc gccttccttg 
ccggacgctg acctcatcga ggccaacctc 
acccgcgtgg agtcagaaaa taaggtagta 
gaggaggatg agaggggagt atccgttccg 
cctcgagcga tgcccatatg ggcacgcccg 
aaggacccgg actacgtccc tccagtggta 
cctccgatac caccttcacg gaggaagagg 
tctgccttgg cggagctcgc cacagagacc 
agcggcacgg caacggcctc tcctgaccag 
gttgagtcgt actcctccat gccccccctt 
gacgggtctt ggtctaccgt aagcgaggag 
tcctacacat ggacaggcgc cctgatcacg 
atcaatgcac tgagcaactc tttgctccgt 
cgcagcgcaa acctgcggca gaagaaggtc 
cactaccggg acgtgctcaa ggagatgaag 
ctatccgtgg aggaagcctg taagctgacg 
tatggggcaa aggacgtccg gaacctatcc 
tggaaggact tgctggaaga cactgagaca" 
gaggttttct gcgtccaacc agagaagggg 
ccagatttgg gggttcgtgt gtgcgagaaa 
cctcaggccg tgatgggctc ttcatacgga 
ttcctggtga atgcctggaa agcgaagaaa 
tgttttgact caacggtcac tgagaatgac 
tgtgacttgg cccccgaagc cagacaggcc 
gggggccccc tgactaattc taaagggcag 
ggtgtactga cgaccagctg cggtaatacc 
tgtcgagctg cgaagctcca ggactgcacg 
atctgtgaaa gcgcggggac ccaagaggac 
atgactagat actctgcccc ccctggggac 
ataacatcat gctcctccaa tgtgtcagtc 
tatctcaccc gtgaccccac cacccccctt 
actccagtca attcctggct aggcaacatc 
atgatcctga tgactcattt cttctccatc 
ctagattgtc agatctacgg ggcctgttac 
attcaacgac tccatggcct tagcgcattt 
aatagggtgg cttcatgcct caggaaactt 
cgggccagaa gtgtccgcgc taggctactg 
aagtacctct tcaactgggc agtaaggacc 



caaaacgagg ttactaccac acaccccata 3660 
gacctggagg tcgtcacgag cacctgggtg 3720 
gcgtattgcc tgacaacagg cagcgtggtc 3780 
ccggccatca ttcccgacag ggaagtcctt 3840 
gcctcacacc tcccttacat cgaacaggga 3900 
gcaatcgggt tgctgcaaac agccaccaag 3960 
tccaagtggc ggaccatcga agccttctgg 4020 
atacaatatt tagcaggctt gtccactctg 4080 
gcattcacag cctctatcac cagcccgctc 4140 
ctggggggat gggtggccgc ccaacttgct 4200 
gccggcatcg ctggagcggc tgttggcagc 4260 
ttggcaggtt atggagcagg ggtggcaggc 4320 
gagttgccct ccaccgagga cctggttaac 4380 
ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
gctgtgcagt ggatgaaccg gctgatagcg 4 500 
acgcactatg tgcctgagag cgacgctgca 4560 
accatcactc agctgctgaa gaggcttcac 4620 
tgctccggct cgtggctaag agatgtttgg 4680 
aagacctggc tccagtccaa gctcctgccg 4740 
caacgtgggt acaagggagt ctggcggggc 4800 
ggagcacaga tcaccggaca tgtgaaaaac 4860 
tgtagtaaca cgtggcatgg aacattcccc 4920 
ccctccccgg cgccaaatta ttctagggcg 4980 
gaggttacgc gggtggggga tttccactac 5040 
tgcccgtgtc aggttccggc ccccgaattc 5100 
aggtacgctc cagcgtgcaa acccctccta 5160 
aatcaatacc tggttgggtc acagctccca 5220 
acttccatgc tcaccgaccc ctcccacatt 5280 
^gggg^tctc cccccccctt ggccagctca 5340 
aaggcaacat gcactacccg tcatgactcc 5400 
ctgtggcggc aggagatggg cgggaacatc 54 60 
attttggact ctttcgagcc gctccaagcg 5520 
gcggagatcc tgcggaggtc caggaaattc 5580 
gattacaacc ctccactgtt agagtcctgg 5640 
cacgggtgtc cattgccgcc tgccaaggcc 5700 
acggttgtcc tgtcagaatc taccgtgtct 5760 
ttcggcagct ccgaatcgtc ggccgtcgac 5820 
ccctccgacg acggcgacgc gggatccgac 5880 
gagggggagc cgggggatcc cgatctcagc 5940 
gctagtgagg acgtcgtctg ctgctcgatg 6000 
ccatgcgctg cggaggaaac caagctgccc 6060 
caccacaact tggtctatgc tacaacatct 6120 
acctttgaca gactgcaggt cctggacgac 6180 
gcgaaggcgt ccacagttaa ggctaaactt 6240 
cccccacatt cggccagatc taaatttggc 6300 
agcaaggccg ttaaccacat ccgctccgtg 6360 
'ccaattgaca ccaccatcat ggcaaaaaat 6420 
ggccgcaagc cagctcgcct tatcgtattc 6480 
atggcccttt acgatgtggt ctccaccctc 6540 
ttccaatact ctcctggaca gcgggtcgag 6600 
tgccctatgg gcttcgcata tgacacccgc 6660 
atccgtgttg aggagtcaat ctaccaatgt 6720 
ataaggtcgc tcacagagcg gctttacatc 6780 
aactgcggct atcgccggtg ccgcgcgagc 6840 
ctcacatgtt acttgaaggc cgctgcggcc 6900 
atgctcgtat gcggagacga ccttgtcgtt 6960 
gaggcgagcc tacgggcctt cacggaggct 7020 
ccgcccaaac cagaatacga cttggagttg 7080 
gcgcacgatg catctggcaa aagggtgtac 7140 
gcgcgggctg cgtgggagac agctagacac 7200 
atcatgtatg cgcccacctt gtgggcaagg 7260 
cttctagctc aggaacaact tgaaaaagcc 7320 
tccattgagc cacttgacct acctcagatc 7380 
tcactccata gttactctcc aggtgagatc 7440 
ggggtaccgc ccttgcgagt ctggagacat 7500 
tcccaggggg ggagggctgc cacttgtggc 75 60 
aagctcaaac tcactccaat cccggctgcg 7620 
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tcccagttgg atttatccag 
agcctgtctc gtgcccgacc 
gtaggcatct atctactccc 
ccatcctgtt tttttccctt 
ttttctcctt tttttttcct 
cctagtcacg gctagctgtg 
tggcctctct gcagatcaag 



ctggttcgtt gctggttaca 
ccgctggttc atgtggtgcc 
caaccgatga acggggagct 
tttttttttc tttttttttt 

ctttttttcc ttttctttcc 
aaaggtccgt gagccgcttg 
t 



gcgggggaga catatatcac 7680 
tactcctact ttctgtaggg 7740 
aaacactcca ggccaatagg 7800 
tttttttttt tttttttttt 7860 

tttggtggct ccatcttagc 7920 
actgcagaga gtgctgatac 7980 

8001 
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<210> 11 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 
<400> 11 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc* 'atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 2460 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
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tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctggaggac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
gtggacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggccgcta 4200 
ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ctgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc 4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
ggtccactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4680 
cgggactggg cccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc ccaacagacg 4920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct 5340 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatgggaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 5460 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaagg tgccggctgc gtatgcaggc caagggtata aggtgcttgt cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa tcggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcga gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggc gacttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct^^tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttgc gggcttacct aaacacacca gggttgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aagcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 6840 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
atcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagtt gccctccacc 7440 
gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgcct 7620 
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gagagcgacg ctgcagcacg agtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggagc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 
aattattcta gggcgctgtg gcgggtaggt gctgaggagt acgtggaggt tacgcgggtg 8100 
ggggatttcc actacgtgac gggcatgacc actgacaacg taaagtgccc gtgtcaggtt 8160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggtcacagc tcccatgcga gcctgaaccg gatgtagcag tgctcacttc catgctcacc 8340 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 8400 
cccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aagcggagga ggatgagagg ggagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct tcacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag ctcgccacag agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaaacctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc -tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 
ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 
agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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CLAIMS 

A hepatitis C virus (HCV) cell culture system, which 
comprises mainly eukaryotic cells containing 
transfected HCV specific genetic material, 
characterized in that, 

the eukaryotic cells are human hepatoma cells and the 
transfected HCV specific genetic material is a HCV-RNA 
construct, which comprises the HCV specific RNA 
segments 5' NTR, NS3, NS4A, NS4B, NS5A, NS5B, and 
3' NTR as well as an additional marker gene for 
selection (selection gene) . 

A cell culture system according to Claim 1, 
characterized in that, 

the hepatoma cells are derived from a usual hepatoma 
cell line- 



A cell culture system 
characterized in that, 
the hepatoma cells are 
hepatoma cell culture. 



according to Claim 1, 
extracted from a primary 



A HCV-RNA construct, characterized in that, 
it comprises the HCV specific RNA segments 5' NTR, 
NS3, NS4A, NS4B, NS5A, NS5B, and 3' NTR as well as an 
additional marker gene for selection (selection gene) . 

A HCV-RNA construct according to Claim 4, 
characterized in that, 

it comprises one of the nucleotide sequences 
illustrated in the sequence protocols SEQ ID NO: 1 to 
SEQ ID NO: 11. 



A HCV-RNA construct according to Claim 4, 
characterized in that. 



CA 02303526 2000-03-31 



the 3' NTR has a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to 
(i) listed in the following: 

( a ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCC 
CAGACTGGAGT 

( b ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTCTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCA 
GACTGGAGC 

( c ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTAAT 
^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 

GACTGGAGC 

(d) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
AGTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGC 
CCAGACTGGAGT 

( e ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGC 
CCAGACTGGAGT 

(f ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTG 
CCCAGACTGGAGT 

( g ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 
GTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCC 
CAGACTGGAGT 

( h ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
TAATCTTTTTTTTTTTTTTCCTTTTTTTGAGAGAGAGAGTCTCACTCTGT 
TGCCCAGACTGGAGT 
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( i ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
7\ATCTTTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTG 
TTGCCCAGACTGGAGT 



5 7 . A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a resistance gene and 
preferably an antibiotic resistance gene. 

10 8 . A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a neomycin 
phosphotransferase gene . 

15 9. A HCV-RNA construct according to one of the Claims 4 
to 8^ characterized in that, 

the marker gene for selection is integrated in the 
HCV-RNA downstream from the 5' NTR. 

20 10. A HCV-RNA construct according to one of the Claims 4 
to 83, characterized in that, 

the marker gene for selection is linked to the HCV-RNA 
via a ribozyme or a recognition sequence for a 
ribozyme . 

25 

11. A HCV-RNA construct according to one of the Claims 4 
to 10, characterized in that, 
it has an integrated reporter gene. 

30 12. A HCV-RNA construct according to Claim 11, 
characterized in that, 

the reporter gene is a gene from the group of the 
luciferase genes, the CAT gene (chloramphenicol acetyl 
transferase gene) , the lacZ gene (beta galactosidase 
35 gene) , the GFP gene (green fluorescence protein gene) , 

the GUS gene (glucuronidase gene) and the SEAP gene 
(secreted alkaline phosphatase gene) . 
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13. A HCV-RNA construct according to one of the Claims 4 

to 11, characterized in that, 

their replication has an influence on the expression 
5 of a (cellular) surrogate marker gene. 

14- A HCV-RNA construct according to one of the Claims 11 
to 13, characterized in that, 

the resistance gene is cloned into the open reading 
10 frame of the HCV-RNA in such a way that it will only 

be transferred to an active form after proteolytic 
processing . 

15. A HCV-RNA construct according to one of the Claims 11 
15 to 14, characterized in that, 

the reporter gene and the marker gene for selection 
are arranged in the construct in such a way, that they 
are expressed as a fusion protein. 

20 16. A cell culture system according to one of the Claims 1 
to 3, characterized in that, 

the HCV-RNA construct is a construct according to at 
least one of the Claims 4 to 15. 

25 17- A cell culture system according to Claim 1, 
characterized in that, 

the cells containing the HCV-RNA construct are lodged 
at the DSMZ, Braunschweig, Germany under the deposit 
number DSM ACC2394 (laboratory name HuBl 9-13) . 

30 

18. A use of a cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production and/or evaluation and/or testing of 
35 treatments and/or diagnostics for the treatment of HCV 

infections in particular. 
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A use of a cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production of a vaccine against HCV infections. 

A use a HCV-RNA construct according to one of the 
Claims 4 to 15 for the production of a liver cell 
specific vector in gene therapy. 

A HCV-RNA construct according to one of the Claims 4 
to 15, characterized in that, 

it has an integrated foreign gene and can be used to 
transfect this foreign gene in a target cell, which 
can be used to express this foreign gene. 

A process for the production of cell culture adapted 
mutants of a HCV-RNA construct according to one of the 
claims 4 to 15, in which the mutants have increased 
replication efficiency compared to the HCV-RNA 
construct, 

characterized in that, 

a cell culture system according to claim 1, in which 
the transfected HCV specific genetic material is a 
HCV-RNA construct with a selection gene according to 
one of the claims 4 to 15, is cultivated on/in the 
selection medium corresponding to the selection gene, 
that the cell clones created are collected, and that 
the HCV-RNA constructs or parts thereof are isolated 
from these cell clones . 

A process according to claim 22, characterized in 
that, 

the isolated HCV-RNA constructs are passaged at least 
one more time. That is, they are transfected in cells 
of a cell culture system according to claim 1. The 
cell culture system of claim 1 thereby obtained, in 
which the transfected HCV specific genetic material is 
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the isolated HCV-RNA construct with a selection gene, 
is cultivated on/in the selection medium corresponding 
to the selection gene, the cell clones created are 
collected, and the HCV-RNA constructs are isolated 
from these cell clones. 

A process for the production of mutants of a HCV full- 
length genome or of a HCV subgenome or any HCV 
construct with increased replication efficiency in 
comparison to the original HCV full-length genome or 
subgenome or HCV-RNA construct, 
characterized in that, 

a cell culture adapted mutant of a HCV-RNA construct 
is produced and isolated by a process according to 
claim 22 or 23, 

the nucleotide and amino acid sequence of these 
mutants is determined and the type, number and 
positions of the nucleotide and amino acid mutations 
is determined by comparison with the nucleotide and 
amino acid sequence of the original HCV-RNA construct, 
and these mutations are introduced in an (isolated) 
HCV full-length genome or a HCV subgenome or any HCV- 
RNA construct, either by site-directed mutagenesis or 
by exchange of fragments containing the relevant 
mutations 

A cell culture adapted HCV-RNA construct with high 
replication efficiency, 
characterized in that, 

it is derivable from a HCV-RNA construct according to 
one of the claims 4 to 15 through nucleotide and/or 
amino acid mutations and that it is obtainable with a 
process according to one of the claims 22 to 24. 

A cell culture adapted HCV-RNA construct according to 

claim 25, 

characterized in that. 
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it carries one or several of the following amino acid 
exchanges: 1283 arg gly and/or 1383 glu ala 

and/or 1577 lys -> arg and/or 1609 lys -> glu and/or 
1936 pro ser and/or 2163 glu gly and/or 2330 lys 
-> glu and/or 2442 ile -> val. 



A cell culture adapted HCV-RNA construct according to 
claim 25 or 26, 
characterized in that, 

it displays one or several of the nucleotide and/or 
amino acid exchanges shown in Table 3, where Table 3 
is an integral part of this claim. 



Cell culture adapted mutant of a HCV-RNA construct or 
of a HCV full-length genome with increased replication 
efficiency in comparison to the original HCV-RNA 
construct or original HCV full-length genome, 
characterized in that, 

it is obtainable with a process in which the number 
and type of mutations in a cell culture adapted HCV- 
RNA construct according to claim 24 are determined 
through sequence analysis and sequence comparison, and 
these mutations are introduced in a HCV-RNA construct, 
particularly in a HCV-RNA construct according to one 
of the claims 4 to 15, or in an (isolated) HCV-RNA 
full-length genome, either by targeted mutagenesis or 
by exchange of sequence sections containing the 
relevant mutations. 



Hepatitis C virus particles or virus-like particles, 
characterized in that, 

they are obtainable by a process according to one of 
the claims 22-24. 



Cells infected with Hepatitis C virus particles 
virus-like particles, according to claim 29. 
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